Back to Search Start Over

PaperMiner—a real-time spatiotemporal visualization for newspaper articles.

Authors :
Kutty, Sangeetha
Nayak, Richi
Turnbull, Paul
Chernich, Ron
Kennedy, Gavin
Raymond, Kerry
Source :
Digital Scholarship in the Humanities; Apr2020, Vol. 35 Issue 1, p83-100, 18p, 4 Diagrams, 2 Charts, 1 Graph, 2 Maps
Publication Year :
2020

Abstract

In 2005, the National Library of Australia (NLA) began a pilot project to selectively digitize back issues of major Australian newspapers to provide free public access to over 60 million digitized newspaper articles, dating from the first years of Australian colonization to the early 1960s. Trove, a faceted search engine maintained by NLA, provides access to this very large collection. Unfortunately, Trove lacked any means to filter by location, which raised the tantalizing possibility of using advanced computational techniques to identify long-term patterns and trends in newspaper reportage of people, events, concepts, and many other historical entities. PaperMiner, which utilizes text mining techniques for extracting metadata information, was developed that enabled the inclusion of geolocations of the places cited in the newspaper articles and supported the searching of articles by location and visualizing the results of searches using both location and time using a map of Australia. Using PaperMiner, researchers could see when and where the anti-Chinese leagues movement started in Australia and how it spread, to better focus their subsequent research. PaperMiner can be used as a digital humanities tool to assist in research by replacing the tedium of a shallow scan through thousands of Trove search results with a more efficient method that draws the researchers' attention to more significant times and places where their time can be better spent in deeper analysis. In this article, we describe the techniques utilized in creating PaperMiner and discuss its usability testing with a group of leading researchers in Australian history. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
2055768X
Volume :
35
Issue :
1
Database :
Complementary Index
Journal :
Digital Scholarship in the Humanities
Publication Type :
Academic Journal
Accession number :
142636784
Full Text :
https://doi.org/10.1093/llc/fqy084