Back to Search Start Over

Using text data instead of SIC codes to tag innovative firms and classify industrial activities.

Authors :
Alessandro Marra
Cristiano Baldassari
Source :
PLoS ONE, Vol 17, Iss 6, p e0270041 (2022)
Publication Year :
2022
Publisher :
Public Library of Science (PLoS), 2022.

Abstract

The paper uses text mining and semantic algorithms to tag innovative firms and offer an alternative perspective to classify industrial activities. Instead of referring to firms' standard industrial classification codes, we gather information from companies' websites and corporate purposes, extract keywords and generate tags concerning firms' activities, specializations, and competences. Evidence is interesting because allows us to understand 'what firms do' in a more penetrating and updated way than referring to standard industrial classification codes. Moreover, through matching firms' keywords, we can explore the degree of closeness between the firms under observation, a measure by which researchers can derive industrial proximity. The analysis can provide policymakers with a detailed and comprehensive picture of the innovative trajectories underlying the industrial structure in a geographic area.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
19326203
Volume :
17
Issue :
6
Database :
Directory of Open Access Journals
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
edsdoj.160f000a58f54de8854ce8ec6158ba43
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pone.0270041