PLS_Toolbox Documentation: simca< shuffle simpls >

simca

Purpose

Create soft independent method of class analogy models for classification.

Synopsis

 

model = simca(x,ncomp,options)       %creates simca model on dataset x

model = simca(x,classid,labels)      %models double x with class id

pred  = simca(x,model,options);      %predictions on x with model

options = simca('options');.

Description

The function SIMCA develops a SIMCA model, which is really a collection of PCA models, one for each class of data in the data set and is used for supervised pattern recognition.

SIMCA cross-validates the PCA model of each class using leave-one-out cross-validation if the number of samples in the class is <= 20. If there are more than 20 samples, the data is split into 10 contiguous blocks.

INPUTS:

                         x =   M x N matrix of class “dataset” where class information is extracted from x.class{1,1} and labels from x.label{1,1}, or

                         x =   M x N data matrix of class “double” and

             classid =   M x 1 vector of class identifiers where each element is an integer identifying the class number of the corresponding sample.

                 model =   when making predictions, input model is a SIMCA model structure.

OPIONAL INPUTS:

                 ncomp =   integer, number of PCs to use in each model. This is rarely known a priori. When ncomp=[] {default} the user is querried for number of PCs for each class.

               labels =   a character array with M rows that is used to label samples on Q vs. T2 plots, otherwise the class identifiers are used.

             options =   a structure array discussed below.

OUPUT:

                 model =   model structure array with the following fields:

     modeltype:  'SIMCA',

    datasource:  structure array with information about input data,

          date:   date of creation,

          time:   time of creation,

          info:   additional model information,

   description:   cell array with text description of model,

      submodel:   structure array with each record containing the PCA model of each class (see PCA), and

        detail:  sub-structure with additional model details and results.

                   pred =   is a structure, similar to model, that contains the SIMCA predictions. Additional, or other, fields in pred are:

                    rtsq:   the reduced T2 (T2 divided by it’s 95Found confidence limit line) where each column corresponds to each class in the SIMCA model,

                        rq:   the reduced Q (Q divided by it’s 95Found confidence limit line) where each column corresponds to each class in the SIMCA model,

                nclass:   the predicted class number (class to which the sample was closest when considering T2 and Q combined), and

    submodelpred:   structure array with each record containing the PCA model predictions for each class (see PCA).

Note: Calling simca with no inputs starts the graphical user interface (GUI) for this analysis method.

Options

             options =   a structure array with the following fields:

              display:   [ {'on'} | 'off' ], governs level of display,

                  plots:   ['none' | {'final'} ], governs level of plotting,

   staticplots:   ['no' | {'yes'} ], produce ole-style "static" plots,

          rule:   [{'combined'} | 'final' | 'T2' | 'Q'], decision rule,

  preprocessing:  { [ ] }, a preprocessing structure (see PREPROCESS) that is used to preprocess data in each class.

The default options can be retreived using: options = simca('options');.

Note: with display='off', plots='none', nocomp=(>0 integer) and preprocessing specified that SIMCA can be run without command line interaction.

See Also

cluster, crossval, discrimprob, pca, plsda, plsdaroc, plsdthres


< shuffle simpls >