Back to Search Start Over

Rule-based Search in Text Databases with Nonstandard Orthography.

Authors :
Pilz, Thomas
Luther, Wolfram
Fuhr, Norbert
Ammon, Ulrich
Source :
Literary & Linguistic Computing. Jun2006, Vol. 21 Issue 2, p179-186. 8p. 1 Diagram, 1 Chart, 1 Graph.
Publication Year :
2006

Abstract

In this article, we describe our interdisciplinary project ‘Rule-based search in text databases with nonstandard orthography (RSNSR)’ in support of the conservation of cultural heritage, especially for the German reception of the philosopher Nietzsche. We present a rule-based fuzzy search engine that allows users to retrieve text data independently of its orthographical realization. The rules used are derived from statistical analyses, historical publications, linguistic principles, and expert knowledge. Our Web-based tool is intended for experts as well as interested amateurs. Along with its present features, further functions are currently worked out. Among them are automatic rule derivation and finer result classification through a generalized Levenshtein similarity measure. Our work is associated with the recently launched project Deutsch Diachron Digital (DDD) to build a complete diachronic corpus of German for the first time with texts from the ninth century (Old High German) to the present (Modern German). [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02681145
Volume :
21
Issue :
2
Database :
Academic Search Index
Journal :
Literary & Linguistic Computing
Publication Type :
Academic Journal
Accession number :
21862116
Full Text :
https://doi.org/10.1093/llc/fql020