Back to Search Start Over

Data Quality May Be All You Need.

Authors :
Edwards, Chris
Source :
Communications of the ACM. Jul2024, Vol. 67 Issue 7, p8-10. 3p.
Publication Year :
2024

Abstract

The article discusses data quality and considers the building and scaling of data through open-source models. It mentions information duplication, degraded model quality, and data filtering in large language models (LLMs). Several research papers are cited including one titled "Textbooks are all you need" which focuses on Phi1, a transformer-based model which synthetically generates high-quality textbooks using output from sources such as GPT-3.5.

Details

Language :
English
ISSN :
00010782
Volume :
67
Issue :
7
Database :
Academic Search Index
Journal :
Communications of the ACM
Publication Type :
Periodical
Accession number :
178212130
Full Text :
https://doi.org/10.1145/3647631