Exploring Multivariate Data with the Forward Search (e-bog) af Cerioli, Andrea
Cerioli, Andrea (forfatter)

Exploring Multivariate Data with the Forward Search e-bog

875,33 DKK (inkl. moms 1094,16 DKK)
Why We Wrote This Book This book is about using graphs to explore and model continuous multi- variate data. Such data are often modelled using the multivariate normal distribution and, indeed, there is a literatme of weighty statistical tomes presenting the mathematical theory of this activity. Our book is very dif- ferent. Although we use the methods described in these books, we focus on ways ...
E-bog 875,33 DKK
Forfattere Cerioli, Andrea (forfatter)
Forlag Springer
Udgivet 17 april 2013
Genrer Probability and statistics
Sprog English
Format pdf
Beskyttelse LCP
ISBN 9780387218403
Why We Wrote This Book This book is about using graphs to explore and model continuous multi- variate data. Such data are often modelled using the multivariate normal distribution and, indeed, there is a literatme of weighty statistical tomes presenting the mathematical theory of this activity. Our book is very dif- ferent. Although we use the methods described in these books, we focus on ways of exploring whether the data do indeed have a normal distribution. We emphasize outlier detection, transformations to normality and the de- tection of clusters and unsuspected influential subsets. We then quantify the effect of these departures from normality on procedures such as dis- crimination and duster analysis. The normal distribution is central to our book because, subject to our exploration of departures, it provides useful models for many sets of data. However, the standard estimates of the parameters, especially the covari- ance matrix of the observations, are highly sensitive to the presence of outliers. This is both a blessing and a curse. It is a blessing because, if we estimate the parameters with the outliers excluded, their effect is appre- ciable and apparent if we then include them for estimation. It is however a curse because it can be hard to detect which observations are outliers. We use the forward search for this purpose.