Complete Guide to Open Source Big Data Stack

Michael Frampton (Autor)

Buch | Softcover

365 Seiten

2018
Apress (Verlag)
978-1-4842-2148-8 (ISBN)

Artikel merken

Describes the step-by-step construction of a real-world big data stack from open source software
Explains popular Apache-based systems such as Hadoop, HBase, Cassandra, Riak, Brooklyn, Spark, Kafka, and more
Author builds a data stack for this book and then recounts the process, including successes, failures, and lessons learned

See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.

In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one piece of the big data stack—sharing how to source the software and how to install it. You learn by simple example, step by step and chapter by chapter, as a real big data stack is created. The book concentrates on Apache-based systems and shares detailed examples of cloud storage, release management, resource management, processing, queuing, frameworks, data visualization, and more.

What You’ll Learn

Install a private cloud onto the local cluster using Apache cloud stack
Source, install, and configure Apache: Brooklyn, Mesos, Kafka, and Zeppelin
See how Brooklyn can be used to install Mule ESB on a cluster and Cassandra in the cloud
Install and use DCOS for big data processing
Use Apache Spark for big data stack data processing

This book is for developers, architects, IT project managers, database administrators, and others charged with developing or supporting a big data system. It is also for anyone interested in Hadoop or big data, and those experiencing problems with data size.

Mike Frampton has been in the IT industry since 1990, working in many roles (tester, developer, support, QA), and in many sectors (telecoms, banking, energy, insurance). He has also worked for major corporations and banks as a contractor and a permanent member of staff, including Agilent, BT, IBM, HP, Reuters, and JP Morgan Chase. The owner of Semtech Solutions, an IT/Big Data consultancy, Mike currently lives by the beach in Paraparaumu, New Zealand, with his wife and son. Mike has a keen interest in new IT-based technologies and the way that technologies integrate. Being married to a Thai national, Mike divides his time between Paraparaumu or Wellington in New Zealand and their house in Roi Et, Thailand.

Chapter 1: The Big Data Stack Overview
Chapter 2: Cloud Storage
Chapter 3: Apache Brooklyn
Chapter 4: Apache Mesos
Chapter 5: Stack Storage Options
Chapter 6: Processing
Chapter 7: Streaming
Chapter 8: Frameworks
Chapter 9: Visualization
Chapter 10: The Big Data Stack

Erscheinungsdatum	26.01.2018
Zusatzinfo	131 Illustrations, color; 36 Illustrations, black and white; XX, 365 p. 167 illus., 131 illus. in color.
Verlagsort	Berkley
Sprache	englisch
Maße	178 x 254 mm
Gewicht	736 g
Einbandart	kartoniert
Themenwelt	Mathematik / Informatik ► Informatik ► Datenbanken
	Mathematik / Informatik ► Informatik ► Netzwerke
	Mathematik / Informatik ► Informatik ► Software Entwicklung
	Informatik ► Theorie / Studium ► Algorithmen
Schlagworte	Apache Brooklyn • Apache cloud stack • Apache Kafka • Apache Spark • Apache Zeppelin • Big Data • Big data stack • Cassandra • Data Visualization • Hadoop • Open Source • Open Source Software • riak
ISBN-10	1-4842-2148-6 / 1484221486
ISBN-13	978-1-4842-2148-8 / 9781484221488
Zustand	Neuware