Back to Search Start Over

Text-based paper-level classification procedure for non-traditional sciences using a machine learning approach.

Authors :
Moctezuma, Daniela
López-Vázquez, Carlos
Lopes, Lucas
Trevisan, Norton
Pérez, José
Source :
Knowledge & Information Systems; Feb2024, Vol. 66 Issue 2, p1503-1520, 18p
Publication Year :
2024

Abstract

Science as a whole is organized into broad fields, and as a consequence, research, resources, students, etc., are also classified, assigned, or invited following a similar structure. Some fields have been established for centuries, and some others are just flourishing. Funding, staff, etc., to support fields are offered if there is some activity on it, commonly measured in terms of the number of published scientific papers. How to find them? There exist well-respected listings where scientific journals are ascribed to one or more knowledge fields. Such lists are human-made, but the complexity begins when a field covers more than one area of knowledge. How to discern if a particular paper is devoted to a field not considered in such lists? In this work, we propose a methodology able to classify the universe of papers into two classes; those belonging to the field of interest, and those that do not. This proposed procedure learns from the title and abstract of papers published in monothematic or "pure" journals. Provided that such journals exist, the procedure could be applied to any field of knowledge. We tested the process with Geographic Information Science. The field has contacts with Computer Science, Mathematics, Cartography, and others, a fact which makes the task very difficult. We also tested our procedure and analyzed its results with three different criteria, illustrating its power and capabilities. Interesting findings were found, where our proposed solution reached similar results as human taggers also similar results compared with state-of-the-art related work. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02191377
Volume :
66
Issue :
2
Database :
Complementary Index
Journal :
Knowledge & Information Systems
Publication Type :
Academic Journal
Accession number :
174839321
Full Text :
https://doi.org/10.1007/s10115-023-02023-0