Back to Search Start Over

Multilingual person name recognition and transliteration

Multilingual person name recognition and transliteration

Authors :
Bruno Pouliquen
Ralf Steinberger
Camelia Ignat
Irina Temnikova
Anna Widiger
Source :
Corela, Vol 2 (2005)
Publication Year :
2005
Publisher :
Cercle linguistique du Centre et de l'Ouest - CerLICO, 2005.

Abstract

We present an exploratory tool that extracts person names from multilingual news collections, matches name variants referring to the same person, and infers relationships between people based on the co-occurrence of their names in related news. A novel feature is the matching of name variants across languages and writing systems, including names written with the Greek, Cyrillic and Arabic writing system. Due to our highly multilingual setting, we use an internal standard representation for name representation and matching, instead of adopting the traditional bilingual approach to transliteration. This work is part of a news analysis system that clusters an average of 25,000 news articles per day to detect related news within the same and across different languages.

Details

Language :
English, French
ISSN :
1638573X
Volume :
2
Database :
Directory of Open Access Journals
Journal :
Corela
Publication Type :
Academic Journal
Accession number :
edsdoj.8783bd069b41dfa9966fbdcd68045b
Document Type :
article
Full Text :
https://doi.org/10.4000/corela.1219