Information-Driven Machine Learning
Springer International Publishing (Verlag)
978-3-031-39479-9 (ISBN)
- Noch nicht erschienen - erscheint am 23.12.2024
- Versandkostenfrei innerhalb Deutschlands
- Auch auf Rechnung
- Verfügbarkeit in der Filiale vor Ort prüfen
- Artikel merken
This groundbreaking book transcends traditional machine learning approaches by introducing information measurement methodologies that revolutionize the field.
Stemming from a UC Berkeley seminar on experimental design for machine learning tasks, these techniques aim to overcome the 'black box' approach of machine learning by reducing conjectures such as magic numbers (hyper-parameters) or model-type bias. Information-based machine learning enables data quality measurements, a priori task complexity estimations, and reproducible design of data science experiments. The benefits include significant size reduction, increased explainability, and enhanced resilience of models, all contributing to advancing the discipline's robustness and credibility.
While bridging the gap between machine learning and disciplines such as physics, information theory, and computer engineering, this textbook maintains an accessible and comprehensive style, making complex topics digestible fora broad readership. Information-Driven Machine Learning explores the synergistic harmony among these disciplines to enhance our understanding of data science modeling. Instead of solely focusing on the "how," this text provides answers to the "why" questions that permeate the field, shedding light on the underlying principles of machine learning processes and their practical implications. By advocating for systematic methodologies grounded in fundamental principles, this book challenges industry practices that have often evolved from ideologic or profit-driven motivations. It addresses a range of topics, including deep learning, data drift, and MLOps, using fundamental principles such as entropy, capacity, and high dimensionality.
Ideal for both academia and industry professionals, this textbook serves as a valuable tool for those seeking to deepen their understanding of data science as an engineering discipline. Its thought-provoking content stimulates intellectual curiosity and caters to readers who desire more than just code or ready-made formulas. The text invites readers to explore beyond conventional viewpoints, offering an alternative perspective that promotes a big-picture view for integrating theory with practice. Suitable for upper undergraduate or graduate-level courses, this book can also benefit practicing engineers and scientists in various disciplines by enhancing their understanding of modeling and improving data measurement effectively.
Gerald Friedland: Listed in the AI2000 Most Influential Scholar list as one of the top-cited research scholars in AI in the last decade, Friedland's contributions to the field of machine learning have been both substantial and enduring since he started working in the field in 2001. His Simple Interactive Object Extraction algorithm has been part of open source image editing and creation tools since 2005 and his cloud-less MOVI Speech Recognition board has been used by makers since 2015. Currently, he is adjunct faculty at the University of California, Berkeley, a Faculty Fellow of the Berkeley Institute of Data Science, and a Principal Scientist in the Sagemaker team at Amazon AWS.
After earning his Ph.D. from Freie Universität Berlin in 2006, Gerald led a team of researchers in speech and multimedia content analysis as the Director of Audio and Multimedia research at the International Computer Science Institute in Berkeley. He then held the role of Principal Data Scientist at Lawrence Livermore National Lab from 2016 to 2019. That year, he co-founded Brainome, Inc., where he harnessed his technical expertise to develop an automatic machine learning tool rooted in the information measurement techniques central to this book. His journey then took him to Amazon AWS in 2022 as a Principal Scientist, AutoML.
Beyond his industry and academic roles, Gerald is a seasoned author. His literature contributions span from the textbooks Multimedia Computing (Cambridge University Press) and Multimodal Location Estimation of Videos and Images (Springer) to a programming book for young children published by Apress.
Preface.- 1 Introduction.- 2 The Automated Scientific Process.- 3 The (Black Box) Machine Learning Process.- 4 Information Theory.- 5 Capacity.- 6 The Mechanics of Generalization.- 7 Meta-Math: Exploring the Limits of Modeling.- 8 Capacity of Neural Networks.- 8 Capacity of Neural Networks.- 10 Capacities of some other Machine Learning Methods.- 11 Data Collection and Preparation.- 12 Measuring Data Sufficiency.- 13 Machine Learning Operations.- 14 Explainability.- 15 Repeatability and Reproducibility.- 16 The Curse of Training and the Blessing of High Dimensionality.- 16 The Curse of Training and the Blessing of High Dimensionality.- Appendix A Recap: The Logarithm.- Appendix B More on Complexity.- Appendix C Concepts Cheat Sheet.- Appendix D A Review Form that Promotes Reproducibility.- List of Illustrations.- Bibliography.
Erscheinungsdatum | 03.12.2024 |
---|---|
Zusatzinfo | XXII, 267 p. 50 illus., 33 illus. in color. |
Verlagsort | Cham |
Sprache | englisch |
Maße | 155 x 235 mm |
Themenwelt | Mathematik / Informatik ► Informatik ► Datenbanken |
Schlagworte | decision trees • explainability • information measurements • Information Theory • machine learning experiments • Neural networks |
ISBN-10 | 3-031-39479-8 / 3031394798 |
ISBN-13 | 978-3-031-39479-9 / 9783031394799 |
Zustand | Neuware |
Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich