Abstract
Questions Classification (QC) is one of the most popular text classification applications. QC plays an important role in question-answering systems. However, as in many real-world classification problems, QC may suffer from the problem of class imbalance. The classification of imbalanced data has been a key problem in machine learning and data mining. In this paper, we propose a framework that deals with the class imbalance using a hierarchical SMOTE algorithm for balancing different types of questions. The proposed framework is grammar-based, which involves using the grammatical pattern for each question and using machine learning algorithms to classify them. Experimental
results imply that the proposed framework demonstrates a good level of accuracy in identifying different question types and handling class imbalance.
results imply that the proposed framework demonstrates a good level of accuracy in identifying different question types and handling class imbalance.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2018 International Conference on Machine Learning and Cybernetics (ICMLC) |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 593-597 |
| Number of pages | 6 |
| Volume | 2 |
| ISBN (Electronic) | 978-1-5386-5214-5 |
| ISBN (Print) | 978-1-5386-5215-2 |
| DOIs | |
| Publication status | Published - 12 Nov 2018 |
| Event | 2018 International Conference on Machine Learning and Cybernetics - http://www.icmlc.com/icmlc/welcome.html, Chengdu, China Duration: 15 Jul 2018 → 18 Jul 2018 |
Publication series
| Name | International Conference on Machine Learning and Cybernetics (ICMLC) |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 2160-133X |
| ISSN (Electronic) | 2160-1348 |
Conference
| Conference | 2018 International Conference on Machine Learning and Cybernetics |
|---|---|
| Abbreviated title | ICMLC 2018 |
| Country/Territory | China |
| City | Chengdu |
| Period | 15/07/18 → 18/07/18 |
Keywords
- Information Retrieval
- Text classification
- Question classification
- Machine Learning
- Class Imbalance