Extension of partitional clustering methods for handling mixed data

Y. Naija, Salem Chakhar, K. Blibech, R. Robbana

    Research output: Contribution to conferencePaperpeer-review

    Abstract

    Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure defined either on numerical attributes or on categorical attributes. However, in fields such as road traffic and medicine, datasets are composed of numerical and categorical attributes. Recently, there have been several proposals to develop clustering methods that support mixed attributes. There are three basic categories of clustering methods: partitional methods, hierarchical methods and density-based methods. This paper proposes an extension of partitional clustering methods devoted to mixed attributes. The proposed extension looks to create several partitions by using numerical attributes-based clustering methods and then chooses the one that maximizes a measure—called “homogeneity degree”—of these partitions according to categorical attributes.
    Original languageEnglish
    Publication statusPublished - 2008
    EventIEEE International Conference on Data Mining Workshops - Pisa, Italy
    Duration: 15 Dec 200819 Dec 2008

    Conference

    ConferenceIEEE International Conference on Data Mining Workshops
    Country/TerritoryItaly
    CityPisa
    Period15/12/0819/12/08

    Fingerprint

    Dive into the research topics of 'Extension of partitional clustering methods for handling mixed data'. Together they form a unique fingerprint.

    Cite this