Back to Search
Start Over
A Baybayin word recognition system
- Source :
- PeerJ Computer Science, PeerJ Computer Science, Vol 7, p e596 (2021)
- Publication Year :
- 2021
- Publisher :
- PeerJ Inc., 2021.
-
Abstract
- Baybayin is a pre-Hispanic Philippine writing system used in Luzon island. With the effort in reintroducing the script, in 2018, the Committee on Basic Education and Culture of the Philippine Congress approved House Bill 1022 or the ”National Writing System Act,” which declares the Baybayin script as the Philippines’ national writing system. Since then, Baybayin OCR has become a field of research interest. Numerous works have proposed different techniques in recognizing Baybayin scripts. However, all those studies anchored on the classification and recognition at the character level. In this work, we propose an algorithm that provides the Latin transliteration of a Baybayin word in an image. The proposed system relies on a Baybayin character classifier generated using the Support Vector Machine (SVM). The method involves isolation of each Baybayin character, then classifying each character according to its equivalent syllable in Latin script, and finally concatenate each result to form the transliterated word. The system was tested using a novel dataset of Baybayin word images and achieved a competitive 97.9% recognition accuracy. Based on our review of the literature, this is the first work that recognizes Baybayin scripts at the word level. The proposed system can be used in automated transliterations of Baybayin texts transcribed in old books, tattoos, signage, graphic designs, and documents, among others.
- Subjects :
- Support vector machine
General Computer Science
Computer science
Computer Vision
Baybayin
02 engineering and technology
computer.software_genre
Scientific Computing and Simulation
Classifier (linguistics)
0202 electrical engineering, electronic engineering, information engineering
Transliteration
Optimization Theory and Computation
business.industry
Latin script
020206 networking & telecommunications
Optical character recognition
QA75.5-76.95
Natural Language and Speech
Computational Linguistics
Character (mathematics)
Writing system
Electronic computers. Computer science
Word recognition
020201 artificial intelligence & image processing
Artificial intelligence
Baybayin word recognition
business
computer
Natural language processing
Word (computer architecture)
Subjects
Details
- Language :
- English
- ISSN :
- 23765992
- Volume :
- 7
- Database :
- OpenAIRE
- Journal :
- PeerJ Computer Science
- Accession number :
- edsair.doi.dedup.....12bfbd71639cd8d6754ed21e17ae8617