What is it about?
Clustering is an unsupervised method where the number of clusters is not known by users. Therefore, the outcomes of a clustering algorithm depend on the input number of clusters specified by users. Consequently it is very important to evaluate the result of the clustering algorithms according to the number of clusters and choose the one that optimize a certain criterion. We present in this paper several clustering validity indices used in the literature. Using several synthetic and real datasets, these indices are then compared based on clustering results provided by the well known k-means clustering algorithm and its non-linear version the kernel K-means algorithm. The results showed that none of the validity indices is superior to the others; in the other hand, the kernel k-means failed to improve clustering accuracy of the dataset from the number of clusters perspective.
Featured Image
Why is it important?
it investigate the results given by kernal kmeans and k-means using different validity indices
Perspectives
is there unique method for all kinds of data?
Alissar Nasser
Lebanese university
Read the Original
This page is a summary of: Investigating K-means and Kernel K-means Algorithms with Internal Validity Indices for Cluster Identification, Journal of Advances in Mathematics and Computer Science, January 2019, Sciencedomain International,
DOI: 10.9734/jamcs/2019/45837.
You can read the full text:
Resources
Contributors
The following have contributed to this page







