Back to Search Start Over

Create distinctive databases of ancient languages and using a computer vision model to accurately recognize and classify them.

Authors :
Saeed EA
Jasim AD
Abdul Malik MA
Source :
Data in brief [Data Brief] 2024 Aug 10; Vol. 56, pp. 110809. Date of Electronic Publication: 2024 Aug 10 (Print Publication: 2024).
Publication Year :
2024

Abstract

Cuneiform writing, an old art style, allows us to see into the past. Aside from Egyptian hieroglyphs, the cuneiform script is one of the oldest writing systems. It emerged in the second half of the fourth millennium BC. Most people believe that the Sumerians originally created it in southern Mesopotamia. Many historians place Hebrew's origins in antiquity. For example, we used the same approach to decipher the cuneiform languages; after learning how to decipher one old language, we would visit an archaeologist to learn how to decipher any other ancient language. We propose a deep-learning-based sign detector method to speed up this procedure to identify and group cuneiform tablet images according to Hebrew letter content. The Hebrew alphabet is notoriously difficult and costly to gather the training data needed for deep learning, which entails enclosing Hebrew characters in boxes. We solve this problem by using pre-existing transliterations and a sign-by-sign representation of the tablet's content in Latin characters. We recommend one of the supervised approaches because these do not include sign localization: We Find the transliteration signs in the tablet photographs by comparing them to their corresponding transliterations. Then, retrain the sign detector using these localized signs instead of utilizing annotations. Afterward, a more effective sign detector enhances the alignment quality. Consequently, this research aims to use the Yolov8 object identification pretraining model to identify Hebrew characters and categorize the cuneiform tablets. Images illustrating Hebrew passages have been culled from a Hebrew-language book. This book is known as the Old Testament, and it was organized into around 500 illustrations to aid in reading and pronouncing the characters. Another ancient document was recently discovered in Iraq, dating back to 500. It reached over a thousand photos after pre-processing and augmentation. The cuneiform digital library initiative (CDLI) website and the Iraqi Museum have compiled photographs of cuneiform tablets, with over a thousand photos available in each language.<br /> (© 2024 The Author(s).)

Details

Language :
English
ISSN :
2352-3409
Volume :
56
Database :
MEDLINE
Journal :
Data in brief
Publication Type :
Academic Journal
Accession number :
39257691
Full Text :
https://doi.org/10.1016/j.dib.2024.110809