Cluster ensemble selection and consensus clustering: a multi-objective optimization approach

Dilay Aktaş, Banu Lokman*, Tülin İnkaya, Gilles Dejaegere

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Cluster ensembles have emerged as a powerful tool to obtain clusters of data points by combining a library of clustering solutions into a consensus solution. In this paper, we address the cluster ensemble selection problem and design a multi-objective optimization-based solution framework to produce consensus solutions. Given a library of clustering solutions, we first design a preprocessing procedure that measures the agreement of each clustering solution with the other solutions and eliminates the ones that may mislead the process. We then develop a multi-objective optimization algorithm that selects representative clustering solutions from the preprocessed library with respect to size, coverage, and diversity criteria and combines them into a single consensus solution, for which the true number of clusters is assumed to be unknown. We conduct experiments on different benchmark data sets. The results show that our approach yields more accurate consensus solutions compared to full-ensemble and the existing approaches for most data sets. We also present an application on the customer segmentation problem, where our approach is used to segment customers and to find a consensus solution for each segment, simultaneously.
Original languageEnglish
Pages (from-to)1065-1077
Number of pages13
JournalEuropean Journal of Operational Research
Volume314
Issue number3
Early online date16 Jan 2024
DOIs
Publication statusPublished - 1 May 2024

Keywords

  • Multiple objective programming
  • cluster ensembles
  • ensemble selection
  • consensus clustering

Fingerprint

Dive into the research topics of 'Cluster ensemble selection and consensus clustering: a multi-objective optimization approach'. Together they form a unique fingerprint.

Cite this