Data Analysis with Python and PySpark - Jonathan Rioux

Data Analysis with Python and PySpark

(Autor)

Buch | Softcover
425 Seiten
2022
Manning Publications (Verlag)
978-1-61729-720-5 (ISBN)
64,70 inkl. MwSt
When it comes to data analytics, it pays tothink big. PySpark blends the powerful Spark big data processing engine withthe Python programming language to provide a data analysis platform that can scaleup for nearly any task. Data Analysis with Python and PySpark is yourguide to delivering successful Python-driven data projects.

Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Had oop-based clusters to Excel worksheets. You'll learn how to break down big analysis tasks into manageable chunks and how to choose and use the best PySpark data abstraction for your unique needs.

The Spark data processing engine is an amazing analytics factory: raw data comes in,and insight comes out. Thanks to its ability to handle massive amounts of data distributed across a cluster, Spark has been adopted as standard by organizations both big and small. PySpark, which wraps the core Spark engine with a Python-based API, puts Spark-based data pipelines in the hands of programmers and data scientists working with the Python programming language. PySpark simplifies Spark's steep learning curve, and provides a seamless bridge between Spark and an ecosystem of Python-based data science tools.

As a data scientist for an engineering consultancy Jonathan Rioux uses PySpark daily. He teaches the software to data scientists, engineers, and data-savvy business analysts.

Erscheinungsdatum
Verlagsort New York
Sprache englisch
Maße 186 x 234 mm
Gewicht 823 g
Themenwelt Mathematik / Informatik Informatik Datenbanken
Mathematik / Informatik Informatik Software Entwicklung
Mathematik / Informatik Informatik Theorie / Studium
Mathematik / Informatik Informatik Web / Internet
ISBN-10 1-61729-720-8 / 1617297208
ISBN-13 978-1-61729-720-5 / 9781617297205
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Einführung in die Praxis der Datenbankentwicklung für Ausbildung, …

von René Steiner

Buch | Softcover (2021)
Springer Fachmedien Wiesbaden GmbH (Verlag)
49,99
Der Grundkurs für Ausbildung und Praxis

von Ralf Adams

Buch (2023)
Carl Hanser (Verlag)
29,99