Novelty Detection for Multivariate Data Streams with Probabilistic Models
Seiten
2022
Kassel University Press (Verlag)
978-3-7376-1038-4 (ISBN)
Kassel University Press (Verlag)
978-3-7376-1038-4 (ISBN)
The autonomous detection of unexpected changes in data is called novelty detection. Multivariate data streams consisting of measurements from multiple sensors often form the basis to detect such changes. Specific examples of such changes are, for instance, cardiacarrhythmias, power failures, storms or network attacks. Accordingly, changes can affect both a system itself and the environment in which it is embedded. This doctoral thesis investigates methods for online novelty detection in multivariate data streams and presents the CANDIES methodology. A unique feature of this method is the
explicit separation of the input space of a probabilistic model into different regions – High-Density Regions (HDR) and Low-Density Regions (LDR) – with detection techniques specifically designed for each. While other detectors can usually only detect novelties or anomalies in LDR, the CANDIES method can also identify novelties in HDR. It also offers possibilities to handle concept drift and noise in data streams. Another distinctive feature of CANDIES is the notion of novelties as an agglomeration of anomalies that have a certain relation to each other (spatially or temporally). Additionally, the focus of this work is also on the experimental evaluation of novelty detection algorithms in general. For this purpose, a data generator
that can synthesise data streams and novelties is presented, and a new evaluation measure, the FDS, is specifically designed to evaluate novelty detection methods. All methods, algorithms and tools developed and used in this thesis are also publicly and freely available online.
explicit separation of the input space of a probabilistic model into different regions – High-Density Regions (HDR) and Low-Density Regions (LDR) – with detection techniques specifically designed for each. While other detectors can usually only detect novelties or anomalies in LDR, the CANDIES method can also identify novelties in HDR. It also offers possibilities to handle concept drift and noise in data streams. Another distinctive feature of CANDIES is the notion of novelties as an agglomeration of anomalies that have a certain relation to each other (spatially or temporally). Additionally, the focus of this work is also on the experimental evaluation of novelty detection algorithms in general. For this purpose, a data generator
that can synthesise data streams and novelties is presented, and a new evaluation measure, the FDS, is specifically designed to evaluate novelty detection methods. All methods, algorithms and tools developed and used in this thesis are also publicly and freely available online.
Erscheinungsdatum | 31.08.2022 |
---|---|
Reihe/Serie | Intelligent Embedded Systems ; 21 |
Verlagsort | Kassel |
Sprache | englisch |
Maße | 148 x 210 mm |
Themenwelt | Informatik ► Datenbanken ► Data Warehouse / Data Mining |
Informatik ► Theorie / Studium ► Algorithmen | |
Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
Schlagworte | anomalies • Anomaly Detection • concept drift • Data streams • machine learning • novelties • Novelty detection • outliers • probabilistic modeling |
ISBN-10 | 3-7376-1038-X / 373761038X |
ISBN-13 | 978-3-7376-1038-4 / 9783737610384 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Datenanalyse für Künstliche Intelligenz
Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95 €
Auswertung von Daten mit pandas, NumPy und IPython
Buch | Softcover (2023)
O'Reilly (Verlag)
44,90 €