Exploring Multivariate Data with the Forward Search - Anthony C. Atkinson, Marco Riani, Andrea Cerioli

Exploring Multivariate Data with the Forward Search

Buch | Hardcover
624 Seiten
2004
Springer-Verlag New York Inc.
978-0-387-40852-1 (ISBN)
160,49 inkl. MwSt
Why We Wrote This Book This book is about using graphs to explore and model continuous multi­ variate data. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data.
Why We Wrote This Book This book is about using graphs to explore and model continuous multi­ variate data. Such data are often modelled using the multivariate normal distribution and, indeed, there is a literatme of weighty statistical tomes presenting the mathematical theory of this activity. Our book is very dif­ ferent. Although we use the methods described in these books, we focus on ways of exploring whether the data do indeed have a normal distribution. We emphasize outlier detection, transformations to normality and the de­ tection of clusters and unsuspected influential subsets. We then quantify the effect of these departures from normality on procedures such as dis­ crimination and duster analysis. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data. However, the standard estimates of the parameters, especially the covari­ ance matrix of the observations, are highly sensitive to the presence of outliers. This is both a blessing and a curse. It is a blessing because, if we estimate the parameters with the outliers excluded, their effect is appre­ ciable and apparent if we then include them for estimation. It is however a curse because it can be hard to detect which observations are outliers. We use the forward search for this purpose.

1 Examples of Multivariate Data.- 2 Multivariate Data and the Forward Search.- 3 Data from One Multivariate Distribution.- 4 Multivariate Transformations to Normality.- 5 Principal Components Analysis.- 6 Discriminant Analysis.- 7 Cluster Analysis.- 8 Spatial Linear Models.- Appendix: Tables of Data.- Author Index.

Erscheint lt. Verlag 9.1.2004
Reihe/Serie Springer Series in Statistics
Zusatzinfo XXIV, 624 p.
Verlagsort New York, NY
Sprache englisch
Maße 155 x 235 mm
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Mathematik Wahrscheinlichkeit / Kombinatorik
ISBN-10 0-387-40852-5 / 0387408525
ISBN-13 978-0-387-40852-1 / 9780387408521
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90