Notice: the WebSM website has not been updated since the beginning of 2018.

Web Survey Bibliography

Title Clustering of Categorical Data by Assigning Rank through Statistical Approach
Year 2012
Access date 22.05.2015
Full text

pdf (456 KB)


Most of the earlier work on clustering has mainly been focused on numerical data whose inherent geometric properties can be exploited to naturally define distance functions between data points. Working only on numeric values prohibits it from being used to cluster real world data containing categorical values. Recently, the problem of clustering categorical data has started drawing interest. The k-means algorithm is well known for its efficiency in this respect. It is also well known for its efficiency in clustering large data sets. However, in this paper we use the k-means algorithm to categorical domains by assigning rank value to the attributes

Year of publication2012
Bibliographic typeJournal article