Back to Search Start Over

Distance Measures and Smoothing Methodology for Imputing Features of Documents.

Authors :
Feuerverger, Andrey
Hall, Peter
Tilahun, Gelila
Gervers, Michael
Source :
Journal of Computational & Graphical Statistics. Jun2005, Vol. 14 Issue 2, p255-262. 8p.
Publication Year :
2005

Abstract

We suggest a new class of metrics for measuring distances between documents, generalizing the well-known resemblance distance. We then show how to combine distance measures with statistical smoothing to develop techniques for imputing missing features of documents. We treat in detail the case where these features are continuous variates, but we note that our methods can be adapted to settings where the features are ordered or unordered categorical variates (e.g., the names of potential authors of the documents). The results of applying our ideas to the dating of medieval manuscripts are briefly summarized. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10618600
Volume :
14
Issue :
2
Database :
Academic Search Index
Journal :
Journal of Computational & Graphical Statistics
Publication Type :
Academic Journal
Accession number :
17276764
Full Text :
https://doi.org/10.1198/106186005X47291