Large-Scale Data Analytics with Python and Spark - Isaac Triguero, Mikel Galar

Large-Scale Data Analytics with Python and Spark

A Hands-on Guide to Implementing Machine Learning Solutions
Buch | Softcover
422 Seiten
2023
Cambridge University Press (Verlag)
978-1-009-31825-9 (ISBN)
37,40 inkl. MwSt
A hands-on textbook teaching how to carry out large-scale data analytics and implement machine learning solutions for big data. Including copious real-world examples, it offers a coherent teaching package with lab assignments, exercises, solutions for instructors, and lecture slides.
Based on the authors' extensive teaching experience, this hands-on graduate-level textbook teaches how to carry out large-scale data analytics and design machine learning solutions for big data. With a focus on fundamentals, this extensively class-tested textbook walks students through key principles and paradigms for working with large-scale data, frameworks for large-scale data analytics (Hadoop, Spark), and explains how to implement machine learning to exploit big data. It is unique in covering the principles that aspiring data scientists need to know, without detail that can overwhelm. Real-world examples, hands-on coding exercises and labs combine with exceptionally clear explanations to maximize student engagement. Well-defined learning objectives, exercises with online solutions for instructors, lecture slides, and an accompanying suite of lab exercises of increasing difficulty in Jupyter Notebooks offer a coherent and convenient teaching package. An ideal teaching resource for courses on large-scale data analytics with machine learning in computer/data science departments.

Isaac Triguero is Distinguished Senior Researcher at the Department of Computer Science and Artificial Intelligence, University of Granada, and Associate Professor of Data Science at the School of Computer Science of the University of Nottingham. He won the 2019 School of Computer Science – University of Nottingham Award for Teaching. Mikel Galar is Associate Professor of Computer Science and Artificial Intelligence at the Department of Statistics, Computer Science and Mathematics, Public University of Navarre. He is a co-founder of Neuraptic AI and won the 2020 Excellence in Teaching Award of the Public University of Navarre.

Part I. Understanding and Dealing with Big Data: 1. Introduction; 2. MapReduce; Part II. Big Data Frameworks: 3. Hadoop; 4. Spark; 5. Spark SQL and DataFrames; Part III. Machine Learning for Big Data: 6. Machine Learning with Spark; 7. Machine Learning for Big Data; 8. Implementing Classical Methods: k-means and Linear Regression; 9. Advanced Examples: Semi-supervised, Ensembles, Deep Learning Model Deployment.

Erscheinungsdatum
Zusatzinfo Worked examples or Exercises
Verlagsort Cambridge
Sprache englisch
Maße 170 x 245 mm
Gewicht 780 g
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
ISBN-10 1-009-31825-X / 100931825X
ISBN-13 978-1-009-31825-9 / 9781009318259
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Das umfassende Handbuch

von Wolfram Langer

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90