# Principal Component Regression

Principal Component Regression is part of:

• ### System configuration

• Windows:
• Versions: 9x/Me/NT/2000/XP/Vista/Win 7/Win 8
• Excel: 97 and later
• Processor: 32 or 64 bits
• Hard disk: 150 Mb
• Mac OS X:
• OS: OS X
• Excel: X, 2004 and 2011
• Hard disk: 150Mb.

## Benefits

• Easy and user-friendly
• Data and results shared seamlessly
• Modular
• Didactic
• Affordable
• Accessible - Available in many languages
• Automatable and customizable

### Principal Component Regression principle

PCR (Principal Components Regression) is a regression method that can be divided into three steps:

1. The first step is to run a PCA (Principal Components Analysis) on the table of the explanatory variables,
2. Then run an Ordinary Least Squares regression (OLS regression) also called linear regression on the selected components,
3. Finally compute the parameters of the model that correspond to the input variables.

### Principal Component Regression models

PCA allows to transform an X table with n observations described by variables into an S table with n scores described by q components, where q is lower or equal to p and such that (S’S) is invertible. An additional selection can be applied on the components so that only the r components that are the most correlated with the Y variable are kept for the OLS regression step. We then obtain the R table.

The OLS regression is performed on the Y and R tables. In order to circumvent the interpretation problem with the parameters obtained from the regression, XLSTAT transforms the results back into the initial space to obtain the parameters and the confidence intervals that correspond to the input variables.

### PCR results: Correlation and observations charts and biplots

As PCR is build on PCA, a great advantage of PCR regression over classical regression is the available charts that describe the data structure. Thanks to the correlation and loading plots it is easy to study the relationship among the variables. It can be relationships among the explanatory variables, as well as between explanatory and dependent variables. The score plot gives information about sample proximity and dataset structure. The biplot gather all these information in one chart.

### Prediction with Principal Component Regression

Principal Componenet Regression is also used to build predictive models. XLSTAT enable you to predict new samples' values.