Complete Guide to Open Source Big Data Stack
Apress (Verlag)
978-1-4842-2148-8 (ISBN)
- Describes the step-by-step construction of a real-world big data stack from open source software
- Explains popular Apache-based systems such as Hadoop, HBase, Cassandra, Riak, Brooklyn, Spark, Kafka, and more
- Author builds a data stack for this book and then recounts the process, including successes, failures, and lessons learned
See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.
In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private cloud and then installs and examines Apache Brooklyn. After that, he uses each chapter to introduce one piece of the big data stack—sharing how to source the software and how to install it. You learn by simple example, step by step and chapter by chapter, as a real big data stack is created. The book concentrates on Apache-based systems and shares detailed examples of cloud storage, release management, resource management, processing, queuing, frameworks, data visualization, and more.
What You’ll Learn
Install a private cloud onto the local cluster using Apache cloud stack
Source, install, and configure Apache: Brooklyn, Mesos, Kafka, and Zeppelin
See how Brooklyn can be used to install Mule ESB on a cluster and Cassandra in the cloud
Install and use DCOS for big data processing
Use Apache Spark for big data stack data processing
This book is for developers, architects, IT project managers, database administrators, and others charged with developing or supporting a big data system. It is also for anyone interested in Hadoop or big data, and those experiencing problems with data size.
Mike Frampton has been in the IT industry since 1990, working in many roles (tester, developer, support, QA), and in many sectors (telecoms, banking, energy, insurance). He has also worked for major corporations and banks as a contractor and a permanent member of staff, including Agilent, BT, IBM, HP, Reuters, and JP Morgan Chase. The owner of Semtech Solutions, an IT/Big Data consultancy, Mike currently lives by the beach in Paraparaumu, New Zealand, with his wife and son. Mike has a keen interest in new IT-based technologies and the way that technologies integrate. Being married to a Thai national, Mike divides his time between Paraparaumu or Wellington in New Zealand and their house in Roi Et, Thailand.
Chapter 1: The Big Data Stack Overview
Chapter 2: Cloud Storage
Chapter 3: Apache Brooklyn
Chapter 4: Apache Mesos
Chapter 5: Stack Storage Options
Chapter 6: Processing
Chapter 7: Streaming
Chapter 8: Frameworks
Chapter 9: Visualization
Chapter 10: The Big Data Stack
Erscheinungsdatum | 26.01.2018 |
---|---|
Zusatzinfo | 131 Illustrations, color; 36 Illustrations, black and white; XX, 365 p. 167 illus., 131 illus. in color. |
Verlagsort | Berkley |
Sprache | englisch |
Maße | 178 x 254 mm |
Gewicht | 736 g |
Einbandart | kartoniert |
Themenwelt | Mathematik / Informatik ► Informatik ► Datenbanken |
Mathematik / Informatik ► Informatik ► Netzwerke | |
Mathematik / Informatik ► Informatik ► Software Entwicklung | |
Informatik ► Theorie / Studium ► Algorithmen | |
Schlagworte | Apache Brooklyn • Apache cloud stack • Apache Kafka • Apache Spark • Apache Zeppelin • Big Data • Big data stack • Cassandra • Data Visualization • Hadoop • Open Source • Open Source Software • riak |
ISBN-10 | 1-4842-2148-6 / 1484221486 |
ISBN-13 | 978-1-4842-2148-8 / 9781484221488 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich