Big Chemical Encyclopedia

Chemical substances, components, reactions, process design ...

Articles Figures Tables About

Centroid-based clustering method

Points with a constant Euclidean distance from a reference point (like the center) are located on a hypersphere (in two dimensions on a circle) points with a constant Mahalanobis distance to the center are located on a hyperellipsoid (in two dimensions on an ellipse) that envelops the cluster of object points (Figure 2.11). That means the Mahalanobis distance depends on the direction. Mahalanobis distances are used in classification methods, by measuring the distances of an unknown object to prototypes (centers, centroids) of object classes (Chapter 5). Problematic with the Mahalanobis distance is the need of the inverse of the covariance matrix which cannot be calculated with highly correlating variables. A similar approach without this drawback is the classification method SIMCA based on PC A (Section 5.3.1, Brereton 2006 Eriksson et al. 2006). [Pg.60]

To generate the dendrogram, HCA methods form clusters of samples based on their nearness in row space. A common approach is to initially treat every sample as a cluster and join closest clusters together. This process is repeated until only one cluster remains. Variations of HCA use different approaches to measure distances between clusters (e.g., single vs. centroid linking, Euclidean vs. Mahalanobis distance), fhe two methods discussed below use single and centroid linking with Euclidean distances. [Pg.216]

In this work, classification through HCA was based on the Euclidean distance and the average group method. This method established links between samples/cluster. The distance between two clusters was computed as the distance between the average values (the mean vector or centroids) of the two clusters. The descriptors employed in HCA were the same selected in... [Pg.193]

A more general model would be a hyperellipsoid as defined by the Mahalanobis distance. SIMCA uses one or several principal components as the axes of the model. If the shape of a group of objects considered is like a cigar then the first principal component would be sufficient to define a suitable model (Figure 8). If the cluster shape is similar to a brick then the second or even the third principal component are also required. If the cluster shape is similar to a ball then only the centroid is necessary for the model. Because the method is based on principal component analysis it is not very sensitive to a large numbo- of features or to collinear features. [Pg.356]


See other pages where Centroid-based clustering method is mentioned: [Pg.407]    [Pg.19]    [Pg.21]    [Pg.32]    [Pg.501]    [Pg.81]    [Pg.164]    [Pg.14]    [Pg.20]    [Pg.22]    [Pg.33]    [Pg.501]    [Pg.31]    [Pg.75]    [Pg.71]    [Pg.78]    [Pg.75]    [Pg.339]    [Pg.278]    [Pg.278]    [Pg.124]    [Pg.373]    [Pg.311]    [Pg.159]    [Pg.15]    [Pg.302]    [Pg.179]    [Pg.16]    [Pg.703]    [Pg.166]   
See also in sourсe #XX -- [ Pg.15 ]




SEARCH



Centroid

Centroid methods

Cluster method

Clustering) centroid

Method clustering

© 2024 chempedia.info