Back to Search Start Over

基于LDA和word2vec的英文作文跑题检测.

Authors :
曲 强
崔荣一
赵亚慧
Source :
Application Research of Computers / Jisuanji Yingyong Yanjiu. Feb2019, Vol. 36 Issue 2, p415-419. 5p.
Publication Year :
2019

Abstract

Aiming at the problem that the lack of accurate and efficient off-topic detection algorithm for the current English composition teaching system in China, this paper proposed an off-topic detection algorithm based on LDA and word2vec. The algorithm used LDA to model the documents and trained it with word2vec, with obtained semantic relation between document’s topic and words, calculated the probability weighted sum of each topic and its feature words in the document. Finally, by setting reasonable threshold, it selected the off-topic essays. According to the different F values for the different number of topics in the document, it determined the optimum number of topics in the experiment. The experimental results show that, compared to traditional vector space model, the proposed method can detect more off-topic essays with higher accuracy, and the F value is above 89%, which realizes the intelligent processing of off-topic essays detection, and may applies effectively in English essays teaching. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10013695
Volume :
36
Issue :
2
Database :
Academic Search Index
Journal :
Application Research of Computers / Jisuanji Yingyong Yanjiu
Publication Type :
Academic Journal
Accession number :
135503019
Full Text :
https://doi.org/10.19734/j.issn.1001-3695.2017.08.0724