Sorry, I don't understand your search. ×
Back to Search Start Over

A Double Parametric Bootstrap Test for Topic Models

Authors :
Seto, Skyler
Tan, Sarah
Hooker, Giles
Wells, Martin T.
Publication Year :
2017

Abstract

Non-negative matrix factorization (NMF) is a technique for finding latent representations of data. The method has been applied to corpora to construct topic models. However, NMF has likelihood assumptions which are often violated by real document corpora. We present a double parametric bootstrap test for evaluating the fit of an NMF-based topic model based on the duality of the KL divergence and Poisson maximum likelihood estimation. The test correctly identifies whether a topic model based on an NMF approach yields reliable results in simulated and real data.<br />Comment: Presented at NIPS 2017 Symposium on Interpretable Machine Learning

Subjects

Subjects :
Statistics - Machine Learning

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.1711.07104
Document Type :
Working Paper