Back to Search
Start Over
Data Quality May Be All You Need.
- Source :
-
Communications of the ACM . Jul2024, Vol. 67 Issue 7, p8-10. 3p. - Publication Year :
- 2024
-
Abstract
- The article discusses data quality and considers the building and scaling of data through open-source models. It mentions information duplication, degraded model quality, and data filtering in large language models (LLMs). Several research papers are cited including one titled "Textbooks are all you need" which focuses on Phi1, a transformer-based model which synthetically generates high-quality textbooks using output from sources such as GPT-3.5.
Details
- Language :
- English
- ISSN :
- 00010782
- Volume :
- 67
- Issue :
- 7
- Database :
- Academic Search Index
- Journal :
- Communications of the ACM
- Publication Type :
- Periodical
- Accession number :
- 178212130
- Full Text :
- https://doi.org/10.1145/3647631