Parallel Computing for Data Science - Norman Matloff

Parallel Computing for Data Science

With Examples in R, C++ and CUDA

(Autor)

Buch | Softcover
328 Seiten
2020
Chapman & Hall/CRC (Verlag)
978-0-367-73819-8 (ISBN)
57,35 inkl. MwSt
This is one of the first parallel computing books to focus exclusively on parallel data structures, algorithms, software tools, and applications in data science. The book prepares readers to write effective parallel code in various languages and learn more about different R packages and other tools. It covers the classic "n observations, p varia
Parallel Computing for Data Science: With Examples in R, C++ and CUDA is one of the first parallel computing books to concentrate exclusively on parallel data structures, algorithms, software tools, and applications in data science. It includes examples not only from the classic "n observations, p variables" matrix format but also from time series, network graph models, and numerous other structures common in data science. The examples illustrate the range of issues encountered in parallel programming.



With the main focus on computation, the book shows how to compute on three types of platforms: multicore systems, clusters, and graphics processing units (GPUs). It also discusses software packages that span more than one type of hardware and can be used from more than one type of programming language. Readers will find that the foundation established in this book will generalize well to other languages, such as Python and Julia.

Dr. Norman Matloff is a professor of computer science at the University of California, Davis, where he was a founding member of the Department of Statistics. He is a statistical consultant and a former database software developer. He has published numerous articles in prestigious journals, such as the ACM Transactions on Database Systems, ACM Transactions on Modeling and Computer Simulation, Annals of Probability, Biometrika, Communications of the ACM, and IEEE Transactions on Data Engineering. He earned a PhD in pure mathematics from UCLA, specializing in probability/functional analysis and statistics.

Introduction to Parallel Processing in R. "Why Is My Program So Slow?": Obstacles to Speed. Principles of Parallel Loop Scheduling. The Shared Memory Paradigm: A Gentle Introduction through R. The Shared Memory Paradigm: C Level. The Shared Memory Paradigm: GPUs. Thrust and Rth. The Message Passing Paradigm. MapReduce Computation. Parallel Sorting and Merging. Parallel Prefix Scan. Parallel Matrix Operations. Inherently Statistical Approaches: Subset Methods. Appendices.

Erscheinungsdatum
Reihe/Serie Chapman & Hall/CRC The R Series
Sprache englisch
Maße 156 x 234 mm
Gewicht 650 g
Themenwelt Mathematik / Informatik Informatik Theorie / Studium
Informatik Weitere Themen Hardware
Mathematik / Informatik Mathematik
Sozialwissenschaften Soziologie
ISBN-10 0-367-73819-8 / 0367738198
ISBN-13 978-0-367-73819-8 / 9780367738198
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
entwickle, drucke und baue deine DIY-Objekte

von Stephan Regele

Buch | Hardcover (2023)
Hanser, Carl (Verlag)
34,99