Exploring Multivariate Data with the Forward Search

Anthony C. Atkinson, Marco Riani, Andrea Cerioli (Autoren)

Buch | Hardcover

624 Seiten

2004
Springer-Verlag New York Inc.
978-0-387-40852-1 (ISBN)

Artikel merken

Why We Wrote This Book This book is about using graphs to explore and model continuous multi variate data. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data.

Why We Wrote This Book This book is about using graphs to explore and model continuous multi variate data. Such data are often modelled using the multivariate normal distribution and, indeed, there is a literatme of weighty statistical tomes presenting the mathematical theory of this activity. Our book is very dif ferent. Although we use the methods described in these books, we focus on ways of exploring whether the data do indeed have a normal distribution. We emphasize outlier detection, transformations to normality and the de tection of clusters and unsuspected influential subsets. We then quantify the effect of these departures from normality on procedures such as dis crimination and duster analysis. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data. However, the standard estimates of the parameters, especially the covari ance matrix of the observations, are highly sensitive to the presence of outliers. This is both a blessing and a curse. It is a blessing because, if we estimate the parameters with the outliers excluded, their effect is appre ciable and apparent if we then include them for estimation. It is however a curse because it can be hard to detect which observations are outliers. We use the forward search for this purpose.

1 Examples of Multivariate Data.- 2 Multivariate Data and the Forward Search.- 3 Data from One Multivariate Distribution.- 4 Multivariate Transformations to Normality.- 5 Principal Components Analysis.- 6 Discriminant Analysis.- 7 Cluster Analysis.- 8 Spatial Linear Models.- Appendix: Tables of Data.- Author Index.

Erscheint lt. Verlag	9.1.2004
Reihe/Serie	Springer Series in Statistics
Zusatzinfo	XXIV, 624 p.
Verlagsort	New York, NY
Sprache	englisch
Maße	155 x 235 mm
Themenwelt	Informatik ► Datenbanken ► Data Warehouse / Data Mining
Themenwelt	Mathematik / Informatik ► Mathematik ► Wahrscheinlichkeit / Kombinatorik
ISBN-10	0-387-40852-5 / 0387408525
ISBN-13	978-0-387-40852-1 / 9780387408521
Zustand	Neuware