Statistics: Multivariate Analysis: Hierarchical Cluster Analysis
Perform hierarchical cluster analysis
This feature is for OriginPro only.
Minimum Origin Version Required:8.6
Please refer to the page for additional option switches when accessing the x-function from script
Display Name |
Variable Name |
I/O and Type |
Default Value |
Description |
---|---|---|---|---|
Variables | irng |
Input Range |
|
Select data range for the hierarchical cluster analysis. Note that beginning with Origin 2020b, there is a shortened syntax that follows the form [Book]Sheet!(N1:N2), N1 = the beginning column index and N2 being the ending column index in a contiguous range of columns. More complex strings from non-contiguous data of the form [Book]Sheet!([Book]Sheet!N1:N2,[Book]Sheet!N3:N4) are also possible. |
Observation Labels | label |
Input Range |
|
Select labels for observations. If labels are chosen, they will be shown as ticks of X axis in the dendrogram. This option is enabled only when obj is Observations. |
Cluster | obj |
Input int |
|
Specify the type of objects to cluster.
Option list:
|
Cluster Method | link |
Input int |
|
Select the linkage method to calculate the distance between a cluster and a new cluster. Values start from 0, but string values (such as near) are recommended for clarity.
Option list:
To learn more about linkage methods, see the algorithm of linkage methods. |
Distance Type | dist1 |
Input int |
|
Select a distance type in the hierarchical cluster analysis when obj is Observations. Values start from 0, but string values (such as euc) are recommended for clarity.
Option list:
|
Distance Type | dist2 |
Input int |
|
Select a distance type in the hierarchical cluster analysis when obj is Variables. Values start from 0, but string values (such as corr) are recommended for clarity.
Option list:
|
Standardize Variables | std |
Input int |
|
Specify the method to standardize variables. It is available only when obj is Observations. Values start from 0, but string values (such as snd) are recommended for clarity.
Option list:
|
Number of Clusters | number |
Input int |
|
Specify the number of clusters. |
Find Clustroid by | stat |
Input int |
Specify the method to find the clustroid: the most/least representative variable/observation.
Option list:
| |
Dissimilarity Matrix | dissimilarity |
Input int |
|
Specify whether to output the distance matrix. For a large number of objects, the distance matrix will be shown in a sheet instead of the report. 1 = Yes, 0 = No. |
Cluster Stages | stage |
Input int |
|
Specify whether to output the cluster stages. 1 = Yes, 0 = No. |
Cluster Center | center |
Input int |
|
Specify whether to calculate cluster centers. It is available only when obj is Observations. 1 = Yes, 0 = No. |
Distance between Cluster Centers | distc2c |
Input int |
|
Specify whether to calculate the distances between cluster centers. It is available only when obj is Observations. 1 = Yes, 0 = No. |
Distance between Observations and Clusters | disto2c |
Input int |
|
Specify whether to calculate the distance between each observation and cluster centers. It is available only when obj is Observations. 1 = Yes, 0 = No. |
Dendrogram | dendrogram |
Input int |
|
Specify whether to show the dendrogram. 1 = Yes, 0 = No. |
Show Dendrogram | ngraph |
Input int |
|
Specify whether to show the dendrogram in a single graph or in separate graphs for clusters. It is enabled only when dendrogram is 1. Values start from 0.
Option list:
|
Orientation | orient |
Input int |
|
Specify the orientation of the dendrogram. Enabled only when dendrogram is Yes.
Option List:
|
Cluster Report | rt |
Output ReportTree |
|
Specify the sheet for the hierarchical cluster analysis report. |
Cluster Membership | rd |
Output ReportData |
|
Specify the sheet for cluster membership and distance between observations and clusters. |
Distance Matrix | rddist |
Output ReportData |
|
Specify the sheet for distance matrix when number of objects to cluster is very large. This variable is hidden in the dialog. |
Plot Data | rdplot |
Output ReportData |
|
Specify the sheet for plot data. This variable is hidden in the dialog. |
Clustroid Info | clustroid |
Input int |
|
Specify the method to find the Clustroid Info: the most/least representative variable/observation |
This function performs the Hierarchical Cluster Analysis on range data. For more information, see the Cluster Analysis.
hcluster irng:=4[1]:15[100] number:=5 rd:=[<input>]<input> -r 2;
See the algorithm of Hierarchical Cluster Analysis.
See the reference of Cluster Analysis.