Back to Search Start Over

A Flexible Big Data System for Credibility-Based Filtering of Social Media Information According to Expertise

Authors :
Jose A. Diaz-Garcia
Karel GutiƩrrez-Batista
Carlos Fernandez-Basso
M. Dolores Ruiz
Maria J. Martin-Bautista
Source :
International Journal of Computational Intelligence Systems, Vol 17, Iss 1, Pp 1-16 (2024)
Publication Year :
2024
Publisher :
Springer, 2024.

Abstract

Abstract Nowadays, social networks have taken on an irreplaceable role as sources of information. Millions of people use them daily to find out about the issues of the moment. This success has meant that the amount of content present in social networks is unmanageable and, in many cases, fake or non-credible. Therefore, a correct pre-processing of the data is necessary if we want to obtain knowledge and value from these data sets. In this paper, we propose a new data pre-processing technique based on Big Data that seeks to solve two of the key concepts of the Big Data paradigm, data validity and credibility of the data and volume. The system is a Spark-based filter that allows us to flexibly select credible users related to a given topic under analysis, reducing the volume of data and keeping only valid data for the problem under study. The proposed system uses the power of word embeddings in conjunction with other text mining and natural language processing techniques. The system has been validated using three real-world use cases.

Details

Language :
English
ISSN :
18756883
Volume :
17
Issue :
1
Database :
Directory of Open Access Journals
Journal :
International Journal of Computational Intelligence Systems
Publication Type :
Academic Journal
Accession number :
edsdoj.70a712746bbe4814bea77f173172a78e
Document Type :
article
Full Text :
https://doi.org/10.1007/s44196-024-00483-y