Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure defined either on numerical attributes or on categorical attributes. However, in fields such as road traffic and medicine, datasets are composed of numerical and categorical attributes. Recently, there have been several proposals to develop clustering methods that support mixed attributes. There are three basic categories of clustering methods: partitional methods, hierarchical methods and density-based methods. This paper proposes an extension of partitional clustering methods devoted to mixed attributes. The proposed extension looks to create several partitions by using numerical attributes-based clustering methods and then chooses the one that maximizes a measure—called “homogeneity degree”—of these partitions according to categorical attributes.
|Publication status||Published - 2008|
|Event||IEEE International Conference on Data Mining Workshops - Pisa, Italy|
Duration: 15 Dec 2008 → 19 Dec 2008
|Conference||IEEE International Conference on Data Mining Workshops|
|Period||15/12/08 → 19/12/08|