Finding Ghosts in Your Data - Kevin Feasel

Finding Ghosts in Your Data (eBook)

Anomaly Detection Techniques with Examples in Python

(Autor)

eBook Download: PDF
2022 | 1st ed.
XX, 353 Seiten
Apress (Verlag)
978-1-4842-8870-2 (ISBN)
Systemvoraussetzungen
62,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Discover key information buried in the noise of data by learning a variety of anomaly detection techniques and using the Python programming language to build a robust service for anomaly detection against a variety of data types. The book starts with an overview of what anomalies and outliers are and uses the Gestalt school of psychology to explain just why it is that humans are naturally great at detecting anomalies. From there, you will move into technical definitions of anomalies, moving beyond 'I know it when I see it' to defining things in a way that computers can understand.

The core of the book involves building a robust, deployable anomaly detection service in Python. You will start with a simple anomaly detection service, which will expand over the course of the book to include a variety of valuable anomaly detection techniques, covering descriptive statistics, clustering, and time series scenarios. Finally, you will compare your anomaly detection service head-to-head with a publicly available cloud offering and see how they perform.

The anomaly detection techniques and examples in this book combine psychology, statistics, mathematics, and Python programming in a way that is easily accessible to software developers. They give you an understanding of what anomalies are and why you are naturally a gifted anomaly detector. Then, they help you to translate your human techniques into algorithms that can be used to program computers to automate the process. You'll develop your own anomaly detection service, extend it using a variety of techniques such as including clustering techniques for multivariate analysis and time series techniques for observing data over time, and compare your service head-on against a commercial service.


What You Will Learn
  • Understand the intuition behind anomalies
  • Convert your intuition into technical descriptions of anomalous data
  • Detect anomalies using statistical tools, such as distributions, variance and standard deviation, robust statistics, and interquartile range
  • Apply state-of-the-art anomaly detection techniques in the realms of clustering and time series analysis
  • Work with common Python packages for outlier detection and time series analysis, such as scikit-learn, PyOD, and tslearn
  • Develop a project from the ground up which finds anomalies in data, starting with simple arrays of numeric data and expanding to include multivariate inputs and even time series data

Who This Book Is For

For software developers with at least some familiarity with the Python programming language, and who would like to understand the science and some of the statistics behind anomaly detection techniques. Readers are not required to have any formal knowledge of statistics as the book introduces relevant concepts along the way.


?Kevin Feasel is a Microsoft Data Platform MVP and CTO at Faregame Inc, where he specializes in data analytics with T-SQL and R, forcing Spark clusters to do his bidding, fighting with Kafka, and pulling rabbits out of hats on demand. He is the lead contributor to Curated SQL, president of the Triangle Area SQL Server Users Group, and author of PolyBase Revealed. A resident of Durham, North Carolina, he can be found cycling the trails along the triangle whenever the weather is nice enough.
Discover key information buried in the noise of data by learning a variety of anomaly detection techniques and using the Python programming language to build a robust service for anomaly detection against a variety of data types. The book starts with an overview of what anomalies and outliers are and uses the Gestalt school of psychology to explain just why it is that humans are naturally great at detecting anomalies. From there, you will move into technical definitions of anomalies, moving beyond "e;I know it when I see it"e; to defining things in a way that computers can understand.The core of the book involves building a robust, deployable anomaly detection service in Python. You will start with a simple anomaly detection service, which will expand over the course of the book to include a variety of valuable anomaly detection techniques, covering descriptive statistics, clustering, and time series scenarios. Finally, you will compare your anomaly detectionservice head-to-head with a publicly available cloud offering and see how they perform.The anomaly detection techniques and examples in this book combine psychology, statistics, mathematics, and Python programming in a way that is easily accessible to software developers. They give you an understanding of what anomalies are and why you are naturally a gifted anomaly detector. Then, they help you to translate your human techniques into algorithms that can be used to program computers to automate the process. You'll develop your own anomaly detection service, extend it using a variety of techniques such as including clustering techniques for multivariate analysis and time series techniques for observing data over time, and compare your service head-on against a commercial service.What You Will LearnUnderstand the intuition behind anomaliesConvert your intuition into technical descriptions of anomalous dataDetect anomalies using statistical tools, such as distributions, variance and standard deviation, robust statistics, and interquartile rangeApply state-of-the-art anomaly detection techniques in the realms of clustering and time series analysisWork with common Python packages for outlier detection and time series analysis, such as scikit-learn, PyOD, and tslearnDevelop a project from the ground up which finds anomalies in data, starting with simple arrays of numeric data and expanding to include multivariate inputs and even time series dataWho This Book Is ForFor software developers with at least some familiarity with the Python programming language, and who would like to understand the science and some of the statistics behind anomaly detection techniques. Readers are not required to have any formal knowledge of statistics as the book introduces relevant concepts along the way.
Erscheint lt. Verlag 9.11.2022
Zusatzinfo XX, 353 p. 106 illus.
Sprache englisch
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Informatik Theorie / Studium Algorithmen
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Mathematik / Informatik Mathematik Statistik
Schlagworte Anomaly Detection • Anomaly Detection as a Service • Anomaly Detection Principles and Algorithms • Anomaly Detection: Techniques and Applications • ARIMA • ARMA • Azure Cognitive Services Anomaly Detector • Changepoint Detection • exponential smoothing • Gestalt • Interquartile range • Mahalanobis Distance • Multivariate Anomaly Detection • Outlier Analysis • Outlier and Anomaly Detection • Python • Robust Statistics • Time Series Anomaly Detection
ISBN-10 1-4842-8870-X / 148428870X
ISBN-13 978-1-4842-8870-2 / 9781484288702
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 11,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
A data engineer's guide to building and managing ETL and ELT …

von Dmitry Foshin; Tonya Chernyshova; Dmitry Anoshin …

eBook Download (2024)
Packt Publishing (Verlag)
39,59