Investigating K-means and Kernel K-means Algorithms with Internal Validity Indices for Cluster Identification

Alissar Nasser

doi:10.9734/jamcs/2019/45837

What is it about?

Clustering is an unsupervised method where the number of clusters is not known by users. Therefore, the outcomes of a clustering algorithm depend on the input number of clusters specified by users. Consequently it is very important to evaluate the result of the clustering algorithms according to the number of clusters and choose the one that optimize a certain criterion. We present in this paper several clustering validity indices used in the literature. Using several synthetic and real datasets, these indices are then compared based on clustering results provided by the well known k-means clustering algorithm and its non-linear version the kernel K-means algorithm. The results showed that none of the validity indices is superior to the others; in the other hand, the kernel k-means failed to improve clustering accuracy of the dataset from the number of clusters perspective.

Why is it important?

it investigate the results given by kernal kmeans and k-means using different validity indices

Perspectives

is there unique method for all kinds of data?
Alissar Nasser
Lebanese university

This page is a summary of: Investigating K-means and Kernel K-means Algorithms with Internal Validity Indices for Cluster Identification, Journal of Advances in Mathematics and Computer Science, January 2019, Sciencedomain International,
DOI: 10.9734/jamcs/2019/45837.
You can read the full text:

Read

Resources

Open Access version
Investigating K-means and Kernel K-means Algorithms with Internal Validity Indices for Cluster Identification
Original-research-article

Contributors

The following have contributed to this page

Alissar Nasser
Lebanese university

Investigating Internal Validity Indices for Cluster Identification

What is it about?

Why is it important?

Perspectives

Resources