Back to Search Start Over

NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles.

Authors :
Tariq, Amara
Karim, Asim
Foroosh, Hassan
Source :
IEEE Transactions on Pattern Analysis & Machine Intelligence. Oct2017, Vol. 39 Issue 10, p2000-2014. 15p.
Publication Year :
2017

Abstract

Named entities such as people, locations, and organizations play a vital role in characterizing online content. They often reflect information of interest and are frequently used in search queries. Although named entities can be detected reliably from textual content, extracting relations among them is more challenging, yet useful in various applications (e.g., news recommending systems). In this paper, we present a novel model and system for learning semantic relations among named entities from collections of news articles. We model each named entity occurrence with sparse structured logistic regression, and consider the words (predictors) to be grouped based on background semantics. This sparse group LASSO approach forces the weights of word groups that do not influence the prediction towards zero. The resulting sparse structure is utilized for defining the type and strength of relations. Our unsupervised system yields a named entities’ network where each relation is typed, quantified, and characterized in context. These relations are the key to understanding news material over time and customizing newsfeeds for readers. Extensive evaluation of our system on articles from TIME magazine and BBC News shows that the learned relations correlate with static semantic relatedness measures like WLM, and capture the evolving relationships among named entities over time. [ABSTRACT FROM PUBLISHER]

Details

Language :
English
ISSN :
01628828
Volume :
39
Issue :
10
Database :
Academic Search Index
Journal :
IEEE Transactions on Pattern Analysis & Machine Intelligence
Publication Type :
Academic Journal
Accession number :
125028213
Full Text :
https://doi.org/10.1109/TPAMI.2016.2632117