Back to Search Start Over

A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering.

Authors :
Kopanitsa, Georgy
Sarrouti, Mourad
Ouatik El Alaoui, Said
Source :
Methods of Information in Medicine; 2017, Vol. 56 Issue 3, p209-247, 8p
Publication Year :
2017

Abstract

<bold>Background and Objective: </bold>Biomedical question type classification is one of the important components of an automatic biomedical question answering system. The performance of the latter depends directly on the performance of its biomedical question type classification system, which consists of assigning a category to each question in order to determine the appropriate answer extraction algorithm. This study aims to automatically classify biomedical questions into one of the four categories: (1) yes/no, (2) factoid, (3) list, and (4) summary.<bold>Methods: </bold>In this paper, we propose a biomedical question type classification method based on machine learning approaches to automatically assign a category to a biomedical question. First, we extract features from biomedical questions using the proposed handcrafted lexico-syntactic patterns. Then, we feed these features for machine-learning algorithms. Finally, the class label is predicted using the trained classifiers.<bold>Results: </bold>Experimental evaluations performed on large standard annotated datasets of biomedical questions, provided by the BioASQ challenge, demonstrated that our method exhibits significant improved performance when compared to four baseline systems. The proposed method achieves a roughly 10-point increase over the best baseline in terms of accuracy. Moreover, the obtained results show that using handcrafted lexico-syntactic patterns as features' provider of support vector machine (SVM) lead to the highest accuracy of 89.40 %.<bold>Conclusion: </bold>The proposed method can automatically classify BioASQ questions into one of the four categories: yes/no, factoid, list, and summary. Furthermore, the results demonstrated that our method produced the best classification performance compared to four baseline systems. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00261270
Volume :
56
Issue :
3
Database :
Complementary Index
Journal :
Methods of Information in Medicine
Publication Type :
Academic Journal
Accession number :
123144976
Full Text :
https://doi.org/10.3414/ME16-01-0116