Correlation Engine 2.0
Clear Search sequence regions

Sizes of these terms reflect their relevance to your search.

Principal components analysis (PCA) is a standard tool in multivariate data analysis to reduce the number of dimensions, while retaining as much as possible of the data's variation. Instead of investigating thousands of original variables, the first few components containing the majority of the data's variation are explored. The visualization and statistical analysis of these new variables, the principal components, can help to find similarities and differences between samples. Important original variables that are the major contributors to the first few components can be discovered as well.This chapter seeks to deliver a conceptual understanding of PCA as well as a mathematical description. We describe how PCA can be used to analyze different datasets, and we include practical code examples. Possible shortcomings of the methodology and ways to overcome these problems are also discussed.


Detlef Groth, Stefanie Hartmann, Sebastian Klie, Joachim Selbig. Principal components analysis. Methods in molecular biology (Clifton, N.J.). 2013;930:527-47

Expand section icon Mesh Tags

Expand section icon Substances

PMID: 23086856

View Full Text