Hadoop in Action
Seiten
2016
|
2nd edition
Manning Publications (Verlag)
978-1-61729-122-7 (ISBN)
Manning Publications (Verlag)
978-1-61729-122-7 (ISBN)
- Titel ist leider vergriffen;
keine Neuauflage - Artikel merken
KEY SELLING POINTS
Hadoop from the ground up
Covers the newer and bigger Hadoop landscape
Demystifies many concepts and components surrounding Hadoop
AUDIENCE
This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.
DESCRIPTION
The massive datasets required for most modern businesses are too large to safely store and efficiently process on a single server. Hadoop is an open source data processing framework that provides a distributed file system that can manage data stored across clusters of servers and implements the MapReduce data processing model so that users can effectively query and utilize big data. The new Hadoop 2.0 is a stable, enterprise-ready platform supported by a rich ecosystem of tools and related technologies such as Pig, Hive, YARN, Spark, Tez, and many more.
Hadoop in Action, Second Edition, provides a comprehensive introduction to Hadoop and shows how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show how Hadoop can be used in more complex data analysis tasks. It covers how YARN, new in Hadoop 2, simplifies and supercharges resource management to make streaming and real-time applications more feasible. Included are best practices and design patterns of MapReduce programming. The book expands on the first edition by enhancing coverage of important Hadoop 2 concepts and systems, and by providing new chapters on data management and data science that reinforce a practical understanding of Hadoop.
KEY SELLING POINTS
Hadoop from the ground up
Covers the newer and bigger Hadoop landscape
Demystifies many concepts and components surrounding Hadoop
AUDIENCE
This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.
ABOUT THE TECHNOLOGY
Hadoop is an open source framework for writing and running distributed applications that process large amounts of data. Distributed computing is a wide and varied field, but the key distinctions of Hadoop are that it is accessible, robust, scalable, and simple, once users have learned a few basic concepts.
Hadoop from the ground up
Covers the newer and bigger Hadoop landscape
Demystifies many concepts and components surrounding Hadoop
AUDIENCE
This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.
DESCRIPTION
The massive datasets required for most modern businesses are too large to safely store and efficiently process on a single server. Hadoop is an open source data processing framework that provides a distributed file system that can manage data stored across clusters of servers and implements the MapReduce data processing model so that users can effectively query and utilize big data. The new Hadoop 2.0 is a stable, enterprise-ready platform supported by a rich ecosystem of tools and related technologies such as Pig, Hive, YARN, Spark, Tez, and many more.
Hadoop in Action, Second Edition, provides a comprehensive introduction to Hadoop and shows how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show how Hadoop can be used in more complex data analysis tasks. It covers how YARN, new in Hadoop 2, simplifies and supercharges resource management to make streaming and real-time applications more feasible. Included are best practices and design patterns of MapReduce programming. The book expands on the first edition by enhancing coverage of important Hadoop 2 concepts and systems, and by providing new chapters on data management and data science that reinforce a practical understanding of Hadoop.
KEY SELLING POINTS
Hadoop from the ground up
Covers the newer and bigger Hadoop landscape
Demystifies many concepts and components surrounding Hadoop
AUDIENCE
This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples.
ABOUT THE TECHNOLOGY
Hadoop is an open source framework for writing and running distributed applications that process large amounts of data. Distributed computing is a wide and varied field, but the key distinctions of Hadoop are that it is accessible, robust, scalable, and simple, once users have learned a few basic concepts.
Chuck Lam and Mark Davis have been working with Hadoop since its earliest days. Chuck is a serial startup veteran and the original author of Hadoop in Action. Mark founded the Hadoop analytics company, Kitenga and is now a Distinguished Big Data Analytics Engineer for Dell and the Big Data Lead for the IEEE Cloud Computing Initiative.
Erscheint lt. Verlag | 28.6.2016 |
---|---|
Verlagsort | New York |
Sprache | englisch |
Gewicht | 1000 g |
Themenwelt | Informatik ► Datenbanken ► Data Warehouse / Data Mining |
Mathematik / Informatik ► Informatik ► Netzwerke | |
Mathematik / Informatik ► Informatik ► Software Entwicklung | |
Mathematik / Informatik ► Informatik ► Theorie / Studium | |
Mathematik / Informatik ► Informatik ► Web / Internet | |
Informatik ► Weitere Themen ► Hardware | |
ISBN-10 | 1-61729-122-6 / 1617291226 |
ISBN-13 | 978-1-61729-122-7 / 9781617291227 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython
Buch | Softcover (2023)
O'Reilly (Verlag)
44,90 €
Datenanalyse für Künstliche Intelligenz
Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95 €