Programming Hive

Data Warehouse and Query Language for Hadoop

Edward Capriolo, Dean Wampler, Jason Rutherglen (Autoren)

Buch | Softcover

400 Seiten

2016 | 2., überarbeitete Auflage
O'Reilly Media (Verlag)
978-1-4919-3445-6 (ISBN)

Titel wird leider nicht erscheinen

Artikel merken

Need to move a relational database application to Hadoop? This example-driven guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure.

Need to move a relational database application to Hadoop? This example-driven guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem.

Completely updated for Hive 0.15.0, the second edition of this popular book shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.

Edward Capriolo is currently System Administrator at Media6degrees where he helps design and maintain distributed data storage systems for the internet advertising industry. Edward is a member of the Apache Software Foundation and a committer for the Hadoop-Hive project. He has experience as a developer as well Linux and network administrator and enjoys the rich world of open source software.

Dean Wampler, Ph.D. is a Consultant for Typesafe, where he specializes in helping clients succeed with Scala and Functional Programming projects. He works with "Big Data" tools like Hadoop, Spark, and Machine Learning libraries, and Reactive tools like Akka and Play. Dean is an O'Reilly author and a frequent conference speaker and organizer. He has a Ph.D. in Physics from the University of Washington.

Jason Rutherglen works at Datastax as a senior Big Data engineer. His work there involves architecting, developing, and supporting the Datastax Enterprise product line which includes Solr integrated with Cassandra. His career has involved an array of technologies including search, Hadoop, Hive, mobile phones, cryptography, and natural language processing. Jason has been developing solutions with Lucene and Solr for more than 7 years.

Erscheinungsdatum	05.05.2016
Verlagsort	Sebastopol
Sprache	englisch
Maße	150 x 250 mm
Gewicht	666 g
Einbandart	kartoniert
Themenwelt	Informatik ► Datenbanken ► Data Warehouse / Data Mining
	Informatik ► Datenbanken ► SQL Language
	Mathematik / Informatik ► Informatik ► Software Entwicklung
Schlagworte	Data Warehouse • Hadoop • Hadoop Hive • Hive • SQL
ISBN-10	1-4919-3445-X / 149193445X
ISBN-13	978-1-4919-3445-6 / 9781491934456
Zustand	Neuware