Programming Hive - Edward Capriolo, Dean Wampler, Jason Rutherglen

Programming Hive

Data Warehouse and Query Language for Hadoop
Buch | Softcover
400 Seiten
2016 | 2., überarbeitete Auflage
O'Reilly Media (Verlag)
978-1-4919-3445-6 (ISBN)
29,70 inkl. MwSt
  • Titel wird leider nicht erscheinen
  • Artikel merken
Need to move a relational database application to Hadoop? This example-driven guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure.
Need to move a relational database application to Hadoop? This example-driven guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem.

Completely updated for Hive 0.15.0, the second edition of this popular book shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.

Edward Capriolo is currently System Administrator at Media6degrees where he helps design and maintain distributed data storage systems for the internet advertising industry. Edward is a member of the Apache Software Foundation and a committer for the Hadoop-Hive project. He has experience as a developer as well Linux and network administrator and enjoys the rich world of open source software.

Dean Wampler, Ph.D. is a Consultant for Typesafe, where he specializes in helping clients succeed with Scala and Functional Programming projects. He works with "Big Data" tools like Hadoop, Spark, and Machine Learning libraries, and Reactive tools like Akka and Play. Dean is an O'Reilly author and a frequent conference speaker and organizer. He has a Ph.D. in Physics from the University of Washington.

Jason Rutherglen works at Datastax as a senior Big Data engineer. His work there involves architecting, developing, and supporting the Datastax Enterprise product line which includes Solr integrated with Cassandra. His career has involved an array of technologies including search, Hadoop, Hive, mobile phones, cryptography, and natural language processing. Jason has been developing solutions with Lucene and Solr for more than 7 years.

Erscheinungsdatum
Verlagsort Sebastopol
Sprache englisch
Maße 150 x 250 mm
Gewicht 666 g
Einbandart kartoniert
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Datenbanken SQL Language
Mathematik / Informatik Informatik Software Entwicklung
Schlagworte Data Warehouse • Hadoop • Hadoop Hive • Hive • SQL
ISBN-10 1-4919-3445-X / 149193445X
ISBN-13 978-1-4919-3445-6 / 9781491934456
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90