1. Keyword-based Uyghur Document Image Retrieval
- Author
-
Kurban Ubul, Yali Zhu, Wenjie Zhou, Alimjan Aysa, and Xuebin Xu
- Subjects
Morphological gradient ,Feature data ,Computer science ,business.industry ,Text segmentation ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Pattern recognition ,Feature (computer vision) ,Classifier (linguistics) ,Word recognition ,Artificial intelligence ,Document retrieval ,business ,Image retrieval - Abstract
In order to solve the problem of printed Uyghur document image retrieval, a two-level matching method from coarse to fine is proposed. Without word recognition, the character of word image is directly used for fast and accurate document retrieval. Firstly, 100 Uyghur document images are segmented into word sets by using the improved morphological gradient algorithm, and then the contour matching of 8 keyword images is performed by using the contour moment feature to obtain the query result set. Finally, MB-LBP and other features are extracted from the word images in the set. The trained OCSVM classifier is used to classify the feature data, and the result is returned to the user. The experimental results of 8 keyword images in 100 document images show that the average accuracy is 64% and the average recall rate is 49%. The experimental results show that efficiency of the keyword-based retrieval method for Uyghur document image.
- Published
- 2020
- Full Text
- View/download PDF