Learn Hadoop in 24 Hours (eBook)
350 Seiten
Publishdrive (Verlag)
978-0-00-031745-2 (ISBN)
Hadoop has changed the way large data sets are analyzed, stored, transferred, and processed. At such low cost, it provides benefits like supports partial failure, fault tolerance, consistency, scalability, flexible schema, and so on. It also supports cloud computing. More and more number of individuals are looking forward to mastering their Hadoop skills.
While initiating with Hadoop, most users are unsure about how to proceed with Hadoop. They are not aware of what are the pre-requisite or data structure they should be familiar with. Or How to make the most efficient use of Hadoop and its ecosystem. To help them with all these queries and other issues this e-book is designed.
The book gives insights into many of Hadoop libraries and packages that are not known to many Big data Analysts and Architects. The e-book also tells you about Hadoop MapReduce and HDFS. The example in the e-book is well chosen and demonstrates how to control Hadoop ecosystem through various shell commands. With this book, users will gain expertise in Hadoop technology and its related components. The book leverages you with the best Hadoop content with the lowest price range.
After going through this book, you will also acquire knowledge on Hadoop Security required for Hadoop Certifications like CCAH and CCDH. It is a definite guide to Hadoop.
Table Contents
Chapter 1: What Is Big Data
Examples Of 'Big Data'
Categories Of 'Big Data'
Characteristics Of 'Big Data'
Advantages Of Big Data Processing
Chapter 2: Introduction to Hadoop
Components of Hadoop
Features Of 'Hadoop'
Network Topology In Hadoop
Chapter 3: Hadoop Installation
Chapter 4: HDFS
Read Operation
Write Operation
Access HDFS using JAVA API
Access HDFS Using COMMAND-LINE INTERFACE
Chapter 5: Mapreduce
How MapReduce works
How MapReduce Organizes Work?
Chapter 6: First Program
Understanding MapReducer Code
Explanation of SalesMapper Class
Explanation of SalesCountryReducer Class
Explanation of SalesCountryDriver Class
Chapter 7: Counters & Joins In MapReduce
Two types of counters
MapReduce Join
Chapter 8: MapReduce Hadoop Program To Join Data
Chapter 9: Flume and Sqoop
What is SQOOP in Hadoop?
What is FLUME in Hadoop?
Some Important features of FLUME
Chapter 10: Pig
Introduction to PIG
Create your First PIG Program
PART 1) Pig Installation
PART 2) Pig Demo
Chapter 11: OOZIE
What is OOZIE?
How does OOZIE work?
Example Workflow Diagram
Oozie workflow application
Why use Oozie?
FEATURES OF OOZIE
Erscheint lt. Verlag | 30.10.2021 |
---|---|
Sprache | englisch |
Themenwelt | Mathematik / Informatik ► Informatik ► Datenbanken |
ISBN-10 | 0-00-031745-4 / 0000317454 |
ISBN-13 | 978-0-00-031745-2 / 9780000317452 |
Haben Sie eine Frage zum Produkt? |
Größe: 3,4 MB
Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM
Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belletristik und Sachbüchern. Der Fließtext wird dynamisch an die Display- und Schriftgröße angepasst. Auch für mobile Lesegeräte ist EPUB daher gut geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine
Geräteliste und zusätzliche Hinweise
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich