Back to Search Start Over

Architectures of Meaning, A Systematic Corpus Analysis of NLP Systems

Authors :
Oskar Wysocki
Malina Florea
Dónal Landers
Andre Freitas
Source :
University of Manchester-PURE
Publication Year :
2021

Abstract

This paper proposes a novel statistical corpus analysis framework targeted towards the interpretation of Natural Language Processing (NLP) architectural patterns at scale. The proposed approach combines saturation-based lexicon construction, statistical corpus analysis methods and graph collocations to induce a synthesis representation of NLP architectural patterns from corpora. The framework is validated in the full corpus of Semeval tasks and demonstrated coherent architectural patterns which can be used to answer architectural questions on a data-driven fashion, providing a systematic mechanism to interpret a largely dynamic and exponentially growing field.<br />20 pages, 6 figures, 9 supplementary figures, Lexicon.txt in the appendix

Details

Language :
English
Database :
OpenAIRE
Journal :
University of Manchester-PURE
Accession number :
edsair.doi.dedup.....64c0d2295c45708e2aee0a5f7388ec00