Abstract
Questions Classification (QC) is one of the most popular text classification applications. QC plays an important role in question-answering systems. However, as in many real-world classification problems, QC may suffer from the problem of class imbalance. The classification of imbalanced data has been a key problem in machine learning and data mining. In this paper, we propose a framework that deals with the class imbalance using a hierarchical SMOTE algorithm for balancing different types of questions. The proposed framework is grammar-based, which involves using the grammatical pattern for each question and using machine learning algorithms to classify them. Experimental
results imply that the proposed framework demonstrates a good level of accuracy in identifying different question types and handling class imbalance.
results imply that the proposed framework demonstrates a good level of accuracy in identifying different question types and handling class imbalance.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2018 International Conference on Machine Learning and Cybernetics (ICMLC) |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 593-597 |
Number of pages | 6 |
Volume | 2 |
ISBN (Electronic) | 978-1-5386-5214-5 |
ISBN (Print) | 978-1-5386-5215-2 |
DOIs | |
Publication status | Published - 12 Nov 2018 |
Event | 2018 International Conference on Machine Learning and Cybernetics - http://www.icmlc.com/icmlc/welcome.html, Chengdu, China Duration: 15 Jul 2018 → 18 Jul 2018 |
Publication series
Name | International Conference on Machine Learning and Cybernetics (ICMLC) |
---|---|
Publisher | IEEE |
ISSN (Print) | 2160-133X |
ISSN (Electronic) | 2160-1348 |
Conference
Conference | 2018 International Conference on Machine Learning and Cybernetics |
---|---|
Abbreviated title | ICMLC 2018 |
Country/Territory | China |
City | Chengdu |
Period | 15/07/18 → 18/07/18 |
Keywords
- Information Retrieval
- Text classification
- Question classification
- Machine Learning
- Class Imbalance