Skip to content

Github repo for my in-progress book, "Visualizing Multivariate Data and Models in R" to be published by Taylor & Francis (CRC Press), 2026

Notifications You must be signed in to change notification settings

friendly/Vis-MLM-book

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Visualizing Multivariate Data and Models in R

This is the main repository for my book, Visualizing Multivariate Data and Models in R, to be published by Chapman & Hall, CRC press.

This book is about graphical methods for multivariate data, and their uses in understanding relationships particularly when there are several aspects to be considered together in multiple response models such as multivariate analysis of variance and multivariate multiple regression.

Features

Some key substantive features of the book are:

  • Statistical data visualization is cast in a general framework by goal (see the data, visualize a model, diagnose problems), rather than a categorization by graphic types. It is best informed by principles and goals of communication, for example making graphic comparison easy and ordering factors and variables according to what should be seen (effect ordering).

  • Data visualization is seen as a combination of exposure---plotting the raw data---and summarization--- plotting statistical summaries---to highlight what should be noticed. For example, data ellipses and confidence ellipses are widely used as simple, effective summaries of data and fitted model parameters. When the data is complex, the idea of visual thinning can be used to balance the tradeoff.

  • The book exploits the rich connections among statistics, geometry and data visualization. Statistical ideas, particularly for multivariate data, can be more easily understood in terms of geometrical ones that can be seen in diagrams and data displays. Moreover, ideas from one domain can amplify what we can understand from another.

  • These graphical tools can be used to understand or explain a wide variety of statistical concepts, phenomena, and paradoxes such as Simpson's paradox, effects of measurement error, and so forth.

  • The HE ("hypothesis - error") plot framework provides a simple way to understand the results of statistical tests and the relations among response outcomes in the multivariate linear model.

  • Dimension reduction techniques such as PCA and discriminant analysis are presented as "multivariate juicers," able to squeeze the important information in high-dimensional data into informative two-dimensional views.

R packages

The book brings together a collection of novel techniques I and others have developed over the past 15 years and implemented in mature R packages. The principal multivariate analysis packages highlighted here are:

  • Hyphothesis-Error plots: heplots,
  • Canonical discriminant analysis: candisc,
  • Multivariate influence: mvinfluence
  • Visualizing collinearity diagnostics: visCollin,
  • Generalized ridge trace plots for ridge regression: genridge,
  • Matrix linear algebra: matlib.

See my GiHub packages page for more details and other packages.

Contributing

I welcome contributions. If you have a suggestion or a bug report please post this as an issue on GitHub.

The online version is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

The work is written under a contributor code of conduct. By participating in this project you agree to abide by its terms and the terms of the license.

CC BY-NC-SA 4.0

About

Github repo for my in-progress book, "Visualizing Multivariate Data and Models in R" to be published by Taylor & Francis (CRC Press), 2026

Topics

Resources

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •