Back to Search Start Over

NELA-GT-2018: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

Authors :
Norregaard, Jeppe
Horne, Benjamin D.
Adali, Sibel
Publication Year :
2019

Abstract

In this paper, we present a dataset of 713k articles collected between 02/2018-11/2018. These articles are collected directly from 194 news and media outlets including mainstream, hyper-partisan, and conspiracy sources. We incorporate ground truth ratings of the sources from 8 different assessment sites covering multiple dimensions of veracity, including reliability, bias, transparency, adherence to journalistic standards, and consumer trust. The NELA-GT-2018 dataset can be found at https://doi.org/10.7910/DVN/ULHLCB.<br />Comment: Published at ICWSM 2019

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.1904.01546
Document Type :
Working Paper