Big Data Smack

A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

Raul Estrada, Isaac Ruiz (Autoren)

Buch | Softcover

289 Seiten

2016
Apress (Verlag)
978-1-4842-2174-7 (ISBN)

Artikel merken

The SMACK stack is relatively new, and there are no books about it currently on the market
Everybody wants to learn about how to incorporate big data, but there is a lack of practical guides
This book covers the full stack of big data architecture, discussing the practical benefits of each technology

This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology—Scala/Spark, Mesos, Akka, Cassandra, and Kafka—in every layer. Big data architecture is becoming a requirement for many different enterprises.

So far, however, the focus has largely been on collecting, aggregating, and crunching large datasets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.

Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation.

The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:

The language: Scala
The engine: Spark (SQL, MLib, Streaming, GraphX)
The container: Mesos, Docker
The view: Akka
The storage: Cassandra
The message broker: Kafka

What you’ll learn
How to make big data architecture without using complex Greek letter architectures.
How to build a cheap but effective cluster infrastructure.
How to make queries, reports, and graphs that business demands.
How to manage and exploit unstructured and No-SQL data sources.
How use tools to monitor the performance of your architecture.
How to integrate all technologies and decide which replace and which reinforce.

This book is for developers, data architects, and data scientists looking for how to integrate the most successful big data open stack architecture and how to choose the correct technology in every layer.

Raul Estrada is the co-founder of Treu Technologies, an enterprise for Social Data Marketing and BigData research. Estrada is an Enterprise Architect with more than 15 years of experience in cluster management and Enterprise Software. Prior to founding Treu Technologies, Estrada worked as an Enterprise Architect in Application Servers & evangelist for Oracle Inc. Estrada loves functional languages like Elixir and Scala, and also has a Master degree on Computer Science.

Isaac Ruiz is a Java programmer since 2001, and a consultant and architect since 2003. Ruiz had participated in projects of different areas and varied scopes (education, communications, retail, and others). Ruiz specializes in systems integration and has participated in projects mainly related to the financial sector. Ruiz is a supporter of free software. Ruiz like to experiment with new technologies (frameworks, languages, methods).

Part 1. Introduction Chapter 1. Big Data, Big Problems Chapter 2. Big Data, Big Solutions Part 2. Playing SMACK Chapter 3. The Language: Scala Chapter 4. The Model: Akka Chapter 5. Storage. Apache Cassandra Chapter 6. The View Chapter 7. The Manager: Apache Mesos Chapter 8. The Broker: Apache Kafka Part 3. Improving SMACK Chapter 9. Fast Data Patterns Chapter 10. Big Data Pipelines Chapter 11. Glossary.

Erscheinungsdatum	12.10.2016
Zusatzinfo	22 black & white illustrations, 52 colour illustrations, biography
Verlagsort	Berkley
Sprache	englisch
Maße	178 x 254 mm
Gewicht	559 g
Einbandart	kartoniert
Themenwelt	Informatik ► Datenbanken ► Data Warehouse / Data Mining
	Mathematik / Informatik ► Informatik ► Netzwerke
	Mathematik / Informatik ► Informatik ► Software Entwicklung
	Informatik ► Theorie / Studium ► Algorithmen
Schlagworte	Akka • Apache Cassandra • Apache Kafka • Apache Mesos • Apache Spark • Big Data • Docker • Hadoop • No-SQL databases • Scala
ISBN-10	1-4842-2174-5 / 1484221745
ISBN-13	978-1-4842-2174-7 / 9781484221747
Zustand	Neuware