Abstract
Gender classification is a popular machine learning task, which has been involved in various application areas, such as business intelligence, access control and cyber security. In the context of information granulation, gender related information can be divided into three types, namely, biological information, vision based information and social network based information.
In traditional machine learning, gender identification has been typically treated as a discriminative classification task, i.e. it is aimed at learning a classifier that discriminates between male and female. In this paper, we argue that it is not always appropriate to identify gender in the way of discriminative classification,
especially when considering the case that both male and female people are of high diversity and thus individuals of different genders could have high similarity to each other in terms of their characteristics. In order to address the above issue, we propose the use of a fuzzy approach for generative classification of gender. In particular, we focus on gender classification based
on social network information. We conduct an experiment study by using a blog data set, and compare the fuzzy approach with C4.5, Naive Bayes and Support Vector Machine in terms of classification performance. The results show that the fuzzy approach outperforms the other approaches and is also capable of capturing the diversity of both male and female people and dealing with the fuzziness in terms of gender identification.
In traditional machine learning, gender identification has been typically treated as a discriminative classification task, i.e. it is aimed at learning a classifier that discriminates between male and female. In this paper, we argue that it is not always appropriate to identify gender in the way of discriminative classification,
especially when considering the case that both male and female people are of high diversity and thus individuals of different genders could have high similarity to each other in terms of their characteristics. In order to address the above issue, we propose the use of a fuzzy approach for generative classification of gender. In particular, we focus on gender classification based
on social network information. We conduct an experiment study by using a blog data set, and compare the fuzzy approach with C4.5, Naive Bayes and Support Vector Machine in terms of classification performance. The results show that the fuzzy approach outperforms the other approaches and is also capable of capturing the diversity of both male and female people and dealing with the fuzziness in terms of gender identification.
Original language | English |
---|---|
Title of host publication | The 10th International Conference on Advanced Computational Intelligence 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 79-84 |
Number of pages | 6 |
ISBN (Electronic) | 978-1-5386-4362-4 |
ISBN (Print) | 978-1-5386-4363-1 |
DOIs | |
Publication status | Published - 11 Jun 2018 |
Event | 10th International Conference on Advanced Computational Intelligence - Xiamen, China Duration: 29 Mar 2018 → 31 Mar 2018 http://www.icaci2018.org/ |
Conference
Conference | 10th International Conference on Advanced Computational Intelligence |
---|---|
Abbreviated title | ICACI 2018 |
Country/Territory | China |
City | Xiamen |
Period | 29/03/18 → 31/03/18 |
Internet address |
Keywords
- Data Mining
- Machine Learning
- fuzzy rule based systems
- Text Classification
- Gender Classification