Data Science with Java - Michael Brzustowicz

Data Science with Java

Buch | Softcover
236 Seiten
2017
O'Reilly Media (Verlag)
978-1-4919-3411-1 (ISBN)
53,85 inkl. MwSt
Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and ability to scale critical to today's data science applications. With this practical book, Java software engineers looking to add data science skills will take a logical journey through the data science pipeline. Author Michael Brzustowicz explains the basic math theory behind each step of the data science process, as well as how to apply these concepts with Java.

You'll learn the critical roles that data IO, linear algebra, statistics, data operations, learning and prediction, and Hadoop MapReduce play in the process. Throughout this book, you'll find code examples you can use in your applications.



Examine methods for obtaining, cleaning, and arranging data into its purest form
Understand the matrix structure that your data should take
Learn basic concepts for testing the origin and validity of data
Transform your data into stable and usable numerical values
Understand supervised and unsupervised learning algorithms, and methods for evaluating their success
Get up and running with MapReduce, using customized components suitable for data science algorithms

Michael Brzustowicz is a physicist turned data scientist. After a PhD from Indiana University, Michael spent his post doctoral years at Stanford University where he shot high powered Xrays at tiny molecules. Jumping ship from academia, he worked at many startups (including his own) and has been pioneering big data techniques all the way. Michael specializes in building distributed data systems and extracting knowledge from massive data. He spends most of his time writing customized, multithreaded code for statistical modeling and machine learning approaches to everyday big data problems. Michael now teaches Big Data, parttime, at the University of San Francisco.

Erscheinungsdatum
Verlagsort Sebastopol
Sprache englisch
Maße 178 x 234 mm
Gewicht 416 g
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Programmiersprachen / -werkzeuge Java
Mathematik / Informatik Informatik Theorie / Studium
Mathematik / Informatik Informatik Web / Internet
ISBN-10 1-4919-3411-5 / 1491934115
ISBN-13 978-1-4919-3411-1 / 9781491934111
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Das umfassende Handbuch

von Wolfram Langer

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90