cluster Documentation


PLS_Toolbox Documentation: cluster	< class2logical	coadd >

cluster

Purpose

K-means and K-nearest neighbor cluster analysis with dendrograms.

Synopsis

cluster(data,labels,options)

cluster(data,options)

options = cluster('options')

Description

cluster(data) performs a cluster analysis on data matrix data using K-means or K-nearest neighbor clustering and plots a dendrogram showing distances between the samples. data can be class "double" or "dataset".

Optional input labels can be used to put labels on the dendrogram plots. For data M by N then labels must be a character array with M rows. When labels is not specified and data is class "double", the dendrogram is plotted using sample numbers. When labels is not specified and data is class "dataset", the dendrogram is plotted using sample labels. If the labels field is empty it will use sample numbers.

The output is a dendrogram showing the sample distances.

Note: Calling cluster with no inputs starts the graphical user interface (GUI) for this analysis method.

Options

牋牋牋牋牋牋 options =牋 a structure array with the following fields:

牋牋牋牋牋牋牋牋牋牋 name:牋 'options', name indicating that this is an options structure,

牋牋牋牋牋 algorithm:牋 [ {'knn'} | 'kmeans' ] clustering algorithm,

牋 preprocessing:牋 {[]} Preprocessing structure or keyword (see PREPROCESS),

牋牋牋牋牋牋牋牋牋牋牋 pca:牋 [ {'false'} | 'true' ] if 'true' then CLUSTER performs PCA first and clustering on the scores,

牋牋牋牋牋牋牋牋牋 ncomp:牋 [] number of PCA factors to use {default = [], the user is prompted to select the number of factors from the SSQ table}, and

牋牋牋牋牋 mahalanobis:牋 [ {'false'} | 'true' ] if 'true' then a Mahalanobis distance on the scores is used.

The default options can be retreived using: options = cluster('options');.