Exploring Multivariate Data with the Forward Search
Seiten
2004
Springer-Verlag New York Inc.
978-0-387-40852-1 (ISBN)
Springer-Verlag New York Inc.
978-0-387-40852-1 (ISBN)
Why We Wrote This Book This book is about using graphs to explore and model continuous multi variate data. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data.
Why We Wrote This Book This book is about using graphs to explore and model continuous multi variate data. Such data are often modelled using the multivariate normal distribution and, indeed, there is a literatme of weighty statistical tomes presenting the mathematical theory of this activity. Our book is very dif ferent. Although we use the methods described in these books, we focus on ways of exploring whether the data do indeed have a normal distribution. We emphasize outlier detection, transformations to normality and the de tection of clusters and unsuspected influential subsets. We then quantify the effect of these departures from normality on procedures such as dis crimination and duster analysis. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data. However, the standard estimates of the parameters, especially the covari ance matrix of the observations, are highly sensitive to the presence of outliers. This is both a blessing and a curse. It is a blessing because, if we estimate the parameters with the outliers excluded, their effect is appre ciable and apparent if we then include them for estimation. It is however a curse because it can be hard to detect which observations are outliers. We use the forward search for this purpose.
Why We Wrote This Book This book is about using graphs to explore and model continuous multi variate data. Such data are often modelled using the multivariate normal distribution and, indeed, there is a literatme of weighty statistical tomes presenting the mathematical theory of this activity. Our book is very dif ferent. Although we use the methods described in these books, we focus on ways of exploring whether the data do indeed have a normal distribution. We emphasize outlier detection, transformations to normality and the de tection of clusters and unsuspected influential subsets. We then quantify the effect of these departures from normality on procedures such as dis crimination and duster analysis. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data. However, the standard estimates of the parameters, especially the covari ance matrix of the observations, are highly sensitive to the presence of outliers. This is both a blessing and a curse. It is a blessing because, if we estimate the parameters with the outliers excluded, their effect is appre ciable and apparent if we then include them for estimation. It is however a curse because it can be hard to detect which observations are outliers. We use the forward search for this purpose.
1 Examples of Multivariate Data.- 2 Multivariate Data and the Forward Search.- 3 Data from One Multivariate Distribution.- 4 Multivariate Transformations to Normality.- 5 Principal Components Analysis.- 6 Discriminant Analysis.- 7 Cluster Analysis.- 8 Spatial Linear Models.- Appendix: Tables of Data.- Author Index.
Erscheint lt. Verlag | 9.1.2004 |
---|---|
Reihe/Serie | Springer Series in Statistics |
Zusatzinfo | XXIV, 624 p. |
Verlagsort | New York, NY |
Sprache | englisch |
Maße | 155 x 235 mm |
Themenwelt | Informatik ► Datenbanken ► Data Warehouse / Data Mining |
Mathematik / Informatik ► Mathematik ► Wahrscheinlichkeit / Kombinatorik | |
ISBN-10 | 0-387-40852-5 / 0387408525 |
ISBN-13 | 978-0-387-40852-1 / 9780387408521 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
Datenanalyse für Künstliche Intelligenz
Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
74,95 €
Auswertung von Daten mit pandas, NumPy und IPython
Buch | Softcover (2023)
O'Reilly (Verlag)
44,90 €