Fuzzy rule based systems for gender classification from blog data

Han Liu, Mihaela Cocea

Research output: Chapter in Book/Report/Conference proceedingConference contribution

99 Downloads (Pure)

Abstract

Gender classification is a popular machine learning task, which has been involved in various application areas, such as business intelligence, access control and cyber security. In the context of information granulation, gender related information can be divided into three types, namely, biological information, vision based information and social network based information.
In traditional machine learning, gender identification has been typically treated as a discriminative classification task, i.e. it is aimed at learning a classifier that discriminates between male and female. In this paper, we argue that it is not always appropriate to identify gender in the way of discriminative classification,
especially when considering the case that both male and female people are of high diversity and thus individuals of different genders could have high similarity to each other in terms of their characteristics. In order to address the above issue, we propose the use of a fuzzy approach for generative classification of gender. In particular, we focus on gender classification based
on social network information. We conduct an experiment study by using a blog data set, and compare the fuzzy approach with C4.5, Naive Bayes and Support Vector Machine in terms of classification performance. The results show that the fuzzy approach outperforms the other approaches and is also capable of capturing the diversity of both male and female people and dealing with the fuzziness in terms of gender identification.
Original languageEnglish
Title of host publicationThe 10th International Conference on Advanced Computational Intelligence 2018
PublisherIEEE
Pages79-84
Number of pages6
ISBN (Electronic)978-1-5386-4362-4
ISBN (Print)978-1-5386-4363-1
DOIs
Publication statusPublished - 11 Jun 2018
Event10th International Conference on Advanced Computational Intelligence: ICACI 2018 - Xiamen, China
Duration: 29 Mar 201831 Mar 2018
http://www.icaci2018.org/

Conference

Conference10th International Conference on Advanced Computational Intelligence
CountryChina
CityXiamen
Period29/03/1831/03/18
Internet address

Keywords

  • Data Mining
  • Machine Learning
  • fuzzy rule based systems
  • Text Classification
  • Gender Classification

Fingerprint

Dive into the research topics of 'Fuzzy rule based systems for gender classification from blog data'. Together they form a unique fingerprint.

Cite this