Back to Search Start Over

Shortcut Learning of Large Language Models in Natural Language Understanding.

Authors :
MENGNAN DU
FENGXIANG HE
NA ZOU
DACHENG TAO
XIA HU
Source :
Communications of the ACM. Jan2024, Vol. 67 Issue 1, p110-120. 11p.
Publication Year :
2024

Abstract

The article looks at the use of large language models to carry out natural language understanding (NLU) tasks. It suggests that the shortcut learning common to existing large language models based on machine learning limits how robust their performance can be because they are overly dependent on spurious correlations and incidental relationships. It discusses possible approaches to overcoming this problem in the future development of large language models.

Details

Language :
English
ISSN :
00010782
Volume :
67
Issue :
1
Database :
Academic Search Index
Journal :
Communications of the ACM
Publication Type :
Periodical
Accession number :
174359746
Full Text :
https://doi.org/10.1145/3596490