Granular approximations: a novel statistical learning approach for handling data inconsistency with respect to a fuzzy relation

Marko Palangetić*, Chris Cornelis, Salvatore Greco, Roman Słowiński

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

15 Downloads (Pure)

Abstract

Inconsistency in classification and regression problems occurs when instances that relate in a certain way on the condition attributes, do not follow the same relation on the decision attribute. It typically appears as a result of perturbation in data caused by incomplete knowledge (missing attributes) or by random effects that occur during data generation (instability in the assessment of decision attribute values). Inconsistencies with respect to a crisp preorder relation (expressing either dominance or indiscernibility between instances) can be handled with set-theoretic approaches like rough sets and by using statistical/machine learning approaches that involve optimization methods. In particular, the Kotłowski-Słowiński (KS) approach relabels the objects from a dataset such that inconsistencies are removed, and such that the new class labels are as close as possible to the original ones in terms of a given loss function. In this paper, we generalize the KS approach to handle inconsistency determined by a fuzzy preorder relation rather than a crisp one. The method produces a consistent fuzzy relabeling of the instances and may be used as a preprocessing tool with algorithms for binary classification and regression. As the obtained fuzzy sets can be represented as unions of meaningful simple fuzzy sets or granules, we call them granular approximations. We provide statistical foundations for our method, develop appropriate optimization procedures, provide didactic examples, and prove several important properties.

Original languageEnglish
Pages (from-to)249-275
Number of pages27
JournalInformation Sciences
Volume629
Early online date8 Feb 2023
DOIs
Publication statusPublished - 1 Jul 2023

Keywords

  • Fuzzy logic
  • Inconsistencies in data
  • Rough sets
  • Statistical learning

Fingerprint

Dive into the research topics of 'Granular approximations: a novel statistical learning approach for handling data inconsistency with respect to a fuzzy relation'. Together they form a unique fingerprint.

Cite this