Back to Search Start Over

Machine learning misclassification of academic publications reveals non-trivial interdependencies of scientific disciplines

Authors :
Yilmaz Uygun
Marc-Thorsten Hütt
Alexey Lyutov
Source :
Scientometrics. 126:1173-1186
Publication Year :
2020
Publisher :
Springer Science and Business Media LLC, 2020.

Abstract

Exploring the production of knowledge with quantitative methods is the foundation of scientometrics. In an application of machine learning to scientometrics, we here consider the classification problem of the mapping of academic publications to the subcategories of a multidisciplinary journal—and hence to scientific disciplines—based on the information contained in the abstract. In contrast to standard classification tasks, we are not interested in maximizing the accuracy, but rather we ask, whether the failures of an automatic classification are systematic and contain information about the system under investigation. These failures can be represented as a ’misclassification network’ inter-relating scientific disciplines. Here we show that this misclassification network (1) gives a markedly different pattern of interdependencies among scientific disciplines than common ’maps of science’, (2) reveals a statistical association between misclassification and citation frequencies, and (3) allows disciplines to be classified as ’method lenders’ and ’content explorers’, based on their in-degree out-degree asymmetry. On a more general level, in a wide range of machine learning applications misclassification networks have the potential of extracting systemic information from the failed classifications, thus allowing to visualize and quantitatively assess those aspects of a complex system, which are not machine learnable.

Details

ISSN :
15882861 and 01389130
Volume :
126
Database :
OpenAIRE
Journal :
Scientometrics
Accession number :
edsair.doi...........2f79f53156ab67032e2982c92b5dbf29