Back to Search
Start Over
Distance Measures and Smoothing Methodology for Imputing Features of Documents.
- Source :
-
Journal of Computational & Graphical Statistics . Jun2005, Vol. 14 Issue 2, p255-262. 8p. - Publication Year :
- 2005
-
Abstract
- We suggest a new class of metrics for measuring distances between documents, generalizing the well-known resemblance distance. We then show how to combine distance measures with statistical smoothing to develop techniques for imputing missing features of documents. We treat in detail the case where these features are continuous variates, but we note that our methods can be adapted to settings where the features are ordered or unordered categorical variates (e.g., the names of potential authors of the documents). The results of applying our ideas to the dating of medieval manuscripts are briefly summarized. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 10618600
- Volume :
- 14
- Issue :
- 2
- Database :
- Academic Search Index
- Journal :
- Journal of Computational & Graphical Statistics
- Publication Type :
- Academic Journal
- Accession number :
- 17276764
- Full Text :
- https://doi.org/10.1198/106186005X47291