Automate a routine analysis, example of Principal Component Analysis, in XLSTAT

Dataset for Principal Component Analysis (PCA) ARCHIVE76.4 KB

Tutorial video
Principal Component Analysis (PCA) is part of: Download Trial version More details See users' feedback
  • Pro Core statistical software

  • System configuration

    • Windows:
      • Versions: 9x/Me/NT/2000/XP/Vista/Win 7
      • Excel: 97 and later
      • Processor: 32 or 64 bits
      • Hard disk: 150 Mb
    • Mac OS X:
      • OS: OS X
      • Excel: X, 2004 and 2011
      • Hard disk: 150Mb.

Benefits

  • Easy and user-friendly
    Easy and user-friendly XLSTAT is flawlessly integrated with Microsoft Excel which is the most popular spreadsheet worldwide. This integration makes it one of the simplest available tools to work with as it utilizes the same philosophy as Microsoft Excel. The program is accessible in a dedicated XLSTAT tab. The analyses are grouped into functional menus. The dialog boxes are user-friendly and setting up an analysis is straightforward.
  • Data and results shared seamlessly
    Data and results shared seamlessly One of the greatest advantages of XLSTAT is the way you can share data and results seamlessly. As the results are stored in Microsoft Excel, anyone can access them. There is no need for the receiver to have an XLSTAT license or any additional viewer which makes your team-work easier and more affordable. In addition, results are easily integrable into other Microsoft Office software such as PowerPoint, so that you can create striking presentation in minutes.
  • Modular
    Modular XLSTAT is a modular product. XLSTAT-Pro is a core statistical module of XLSTAT which includes all the mainstream functionalities in statistics and multivariate analysis. More advanced features contained in add-on modules can be added for specific applications. This way you can adapt the software to your needs making the software more cost-efficient.
  • Didactic
    Didactic The results of XLSTAT are organized by analysis and are easy to navigate. Moreover useful information is provided along with the results to assist you in your interpretation.
  • Affordable
    Affordable XLSTAT is a complete and modular analytical solution that can suit any analytical business needs. It is very reasonably priced so that the return of your investment is almost immediate. Any XLSTAT license comes with top level support and assistance.
  • Accessible - Available in many languages
    Accessible - Available in many languages We have ensured XLSTAT is accessible to everyone by making the program available in many languages, including Chinese, English, French, German, Italian, Japanese, Polish, Portuguese and Spanish.
  • Automatable and customizable
    Automatable and customizable Most of the statistical functions available in XLSTAT can be called directly from the Visual Basic window of Microsoft Excel. They can be modified and integrated to more code to fit to the specificity of your domain. Adding tables and plots as well as modifying existing outputs becomes easy. Furthermore, XLSTAT includes some special tools on the dialog boxes to generate automatically the VBA code in order to reproduce your analysis using the VBA editor or to simply load pre-set settings. This effortless automation of routine analysis will be a huge time saver on your part.

Dataset for automating a routine analysis

Two Excel workbooks with both the data and the results can be downloaded by clicking here.

The data used is the process measurements of food samples.

Creating the VBA codes to be reused

We are going to create a Principal component analysis template on one dataset and use it on the second.

Generating the code to automate a routine analysis

Open the first file Automation_1.xls

Once XLSTAT-Pro is activated, go to the menu Options and in the tab Advanced enable the option Show the advanced buttons in the dialog boxes.

Principal Component Analysis

The next step of the automation procedure is to set up your statistical analysis.

Select the XLSTAT / Analyzing data / Principal components analysis command, or click on the corresponding button of the Analyzing Data toolbar (see below).

automate a Principal Component AnalysisPrincipal Component Analysis software

In the General tab, set the following:

  • Observations/variables table: Columns B to G
  • Data format: Observations/variables table
  • PCA type: Pearson (n)
  • Variable labels: enabled
  • Observation labels: ticked and select the column A for the sample name
  • Sheet: chosen to display the results in a new sheet

automation software

Go to the next tab Options. For the option Filter factors, choose Maximum number and set the value to six. This way all the components will be calculated.

automate an analysis in Excel

Go to the tab Outputs. Here we want to get a synthetic report so we will only select the following:

  • Eigenvalues,
  • Factor Loadings,
  • Variables/Factors correlations,
  • Factor scores.

Principal Component Analysis package

Finally we are going to use all three plots that can be selected in the Charts tab:

  • Correlation charts
  • Observations charts
  • Biplots

Principal Component Analysis tools

Now we have specified all the setting we will save the code to be reused.

Generate the VBA code to be reused

Click on the grey button at the bottom left of the dialog box: Click this button to generate the VBA code that will allow you to run the dialog box from your code.

Principal Component Analysis with MS Excel

Once you have pressed the button a Notepad document will appear containing the VBA code. Save the code under a name that is easy for you to remember, for example in this case we use "VBA-PCA-recipe1".

VBA code in XLSTAT

Results of the analysis

Click on OK to launch the analysis.

Now choose the plot for the axes F1 and F2 by clicking Select, then change the selection to Abscissa F3 and Ordinates F4. Once you have completed this click again on Select and then press Done.

automation tutorial

Have a look at the biplot.

Principal Component Analysis software Excel

This process is usually stable so we can expect little variation. You can see that all the samples are centered tidily around the middle of the plot.

Reusing the VBA code

Now open the second file Automation_2.xls Press Alt+F11 together in order to launch the Visual Basic Application. Then select Sheet1 in the folder VBAProject(Automation_2.xls) and finally right click and opt for the action Insert / Module.

automate a routine analysis

The next step is to copy and paste the code contained in the Notepad file into this module.

automation

At this step you can add more codes to enable the programme to perform other actions. Go to the menu Run / Run Macro located in the menu bar.

automation of analysis

Then you need to run first the macro called "RunMeOnce". This will make a link between the file and the XLSTAT project where the code is stored. Select it in the list and click on Run.

VBA automation software

When this has been completed, run the second macro called "MySub". Return to the menu Run / Run Macro and this time select the macro "MySub" before pressing the button Run. This will in turn execute the code in question and you now have a sheet "PCA" containing the results. Now if we look at the biplot of the second analysis we notice that this time one of the samples seems to be further away thant the other samples. Sample 13 may be an outlier.

Principal Component Analysis biplot

Have a look at this video to see a demonstration on how to automate data analysis with XLSTAT software.