Hands-on Scikit-Learn for Machine Learning Applications -  David Paper

Hands-on Scikit-Learn for Machine Learning Applications (eBook)

Data Science Fundamentals with Python

(Autor)

eBook Download: PDF
2019 | 1st ed.
XIII, 242 Seiten
Apress (Verlag)
978-1-4842-5373-1 (ISBN)
Systemvoraussetzungen
56,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Aspiring data science professionals can learn the Scikit-Learn library along with the fundamentals of machine learning with this book. The book combines the Anaconda Python distribution with the popular Scikit-Learn library to demonstrate a wide range of supervised and unsupervised machine learning algorithms. Care is taken to walk you through the principles of machine learning through clear examples written in Python that you can try out and experiment with at home on your own machine.

All applied math and programming skills required to master the content are covered in this book. In-depth knowledge of object-oriented programming is not required as working and complete examples are provided and explained. Coding examples are in-depth and complex when necessary. They are also concise, accurate, and complete, and complement the machine learning concepts introduced. Working the examples helps to build the skills necessary to understand and apply complex machine learning algorithms.

Hands-on Scikit-Learn for Machine Learning Applications is an excellent starting point for those pursuing a career in machine learning. Students of this book will learn the fundamentals that are a prerequisite to competency. Readers will be exposed to the Anaconda distribution of Python that is designed specifically for data science professionals, and will build skills in the popular Scikit-Learn library that underlies many machine learning applications in the world of Python.


What You'll Learn
  • Work with simple and complex datasets common to Scikit-Learn
  • Manipulate data into vectors and matrices for algorithmic processing
  • Become familiar with the Anaconda distribution used in data science
  • Apply machine learning with Classifiers, Regressors, and Dimensionality Reduction
  • Tune algorithms and find the best algorithms for each dataset
  • Load data from and save to CSV, JSON, Numpy, and Pandas formats

Who This Book Is For

The aspiring data scientist yearning to break into machine learning through mastering the underlying fundamentals that are sometimes skipped over in the rush to be productive. Some knowledge of object-oriented programming and very basic applied linear algebra will make learning easier, although anyone can benefit from this book.



Dr. David Paper is a professor at Utah State University in the Management Information Systems department. He wrote the book Web Programming for Business: PHP Object-Oriented Programming with Oracle and he has over 70 publications in refereed journals such as Organizational Research Methods, Communications of the ACM, Information & Management, Information Resource Management Journal, Communications of the AIS, Journal of Information Technology Case and Application Research, and Long Range Planning. He has also served on several editorial boards in various capacities, including associate editor. Besides growing up in family businesses, Dr. Paper has worked for Texas Instruments, DLS, Inc., and the Phoenix Small Business Administration. He has performed IS consulting work for IBM, AT&T, Octel, Utah Department of Transportation, and the Space Dynamics Laboratory. Dr. Paper's teaching and research interests include data science, process reengineering, object-oriented programming, electronic customer relationship management, change management, e-commerce, and enterprise integration.
Aspiring data science professionals can learn the Scikit-Learn library along with the fundamentals of machine learning with this book. The book combines the Anaconda Python distribution with the popular Scikit-Learn library to demonstrate a wide range of supervised and unsupervised machine learning algorithms. Care is taken to walk you through the principles of machine learning through clear examples written in Python that you can try out and experiment with at home on your own machine.All applied math and programming skills required to master the content are covered in this book. In-depth knowledge of object-oriented programming is not required as working and complete examples are provided and explained. Coding examples are in-depth and complex when necessary. They are also concise, accurate, and complete, and complement the machine learning concepts introduced. Working the examples helps to build the skills necessary to understand and apply complexmachine learning algorithms.Hands-on Scikit-Learn for Machine Learning Applications is an excellent starting point for those pursuing a career in machine learning. Students of this book will learn the fundamentals that are a prerequisite to competency. Readers will be exposed to the Anaconda distribution of Python that is designed specifically for data science professionals, and will build skills in the popular Scikit-Learn library that underlies many machine learning applications in the world of Python.What You'll LearnWork with simple and complex datasets common to Scikit-LearnManipulate data into vectors and matrices for algorithmic processingBecome familiar with the Anaconda distribution used in data scienceApply machine learning with Classifiers, Regressors, and Dimensionality ReductionTune algorithms and find the best algorithms for each datasetLoad data from and save to CSV, JSON, Numpy, and Pandas formatsWho This Book Is ForThe aspiring data scientist yearning to break into machine learning through mastering the underlying fundamentals that are sometimes skipped over in the rush to be productive. Some knowledge of object-oriented programming and very basic applied linear algebra will make learning easier, although anyone can benefit from this book.

Table of Contents 5
About the Author 8
About the Technical Reviewer 9
Introduction 10
Chapter 1: Introduction to Scikit-Learn 11
Machine Learning 11
Anaconda 12
Scikit-Learn 13
Data Sets 13
Characterize Data 14
Simple Classification Data 14
Iris Data 14
Wine Data 17
Bank Data 19
Digits Data 20
Complex Classification Data 24
Newsgroup Data 25
MNIST Data 26
Faces Data 29
Regression Data 31
Tips Data 31
Red and White Wine 33
Boston Data 36
Feature Scaling 37
Dimensionality Reduction 40
Chapter 2: Classification from Simple Training Sets 46
Simple Data Sets 47
Classifying Wine Data 47
Classifying Digits 52
Classifying Bank Data 61
Classifying make_moons 73
Chapter 3: Classification from  Complex Training Sets 79
Complex Data Sets 79
Classifying fetch_20newsgroups 79
Classifying MNIST 87
Training with the Entire MNIST Data Set 87
Training MNIST Sample Data 95
Classifying fetch_lfw_people 103
Chapter 4: Predictive Modeling Through Regression 113
Regression Data Sets 113
Regressing tips 114
Regressing boston 125
Regressing wine data 130
Chapter 5: Scikit-Learn Classifier Tuning from Simple Training Sets 145
Tuning Data Sets 147
Tuning Iris Data 148
Tuning Digits Data 152
Tuning Bank Data 157
Tuning Wine Data 165
Chapter 6: Scikit-Learn Classifier Tuning from Complex Training Sets 172
Tuning Data Sets 173
Tuning fetch_1fw_people 173
Tuning MNIST 182
Tuning fetch_20newsgroups 191
Chapter 7: Scikit-Learn Regression Tuning 196
Tuning Data Sets 197
Tuning tips 197
Tuning boston 206
Tuning wine 215
Chapter 8: Putting It All Together 221
The Journey 221
Value and Cost 222
MNIST Value and Cost 224
Explaining MNIST to Money People 228
Explaining Output to Money People 228
Explaining the Confusion Matrix to Money People 229
Explaining Visualizations to Money People 230
Value and Cost 230
fetch_lfw_people Value and Cost 231
Explaining fetch_lfw_people to Money People 235
Explaining Output to Money People 235
Explaining Visualizations to Money People 236
Value and Cost 236
fetch_20newsgroups Value and Cost 237
Explaining fetch_20newsgroups to Money People 241
Explaining Output to Money People 241
Explaining the Confusion Matrix to Money People 241
Value and Cost 242
Index 244

Erscheint lt. Verlag 16.11.2019
Zusatzinfo XIII, 242 p. 33 illus.
Sprache englisch
Themenwelt Mathematik / Informatik Informatik Datenbanken
Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Schlagworte Algorithmic Processing • Anaconda Distribution • Classifiers • confusion matrix • Data Science • dimensionality reduction • machine learning • NumPy • Pandas Format • Python • Regressors • scikit-learn
ISBN-10 1-4842-5373-6 / 1484253736
ISBN-13 978-1-4842-5373-1 / 9781484253731
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 3,3 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99
Wie du KI richtig nutzt - schreiben, recherchieren, Bilder erstellen, …

von Rainer Hattenhauer

eBook Download (2023)
Rheinwerk Computing (Verlag)
24,90