Back to Search Start Over

Automated pipeline for superalloy data by text mining

Authors :
Weiren Wang
Xue Jiang
Shaohan Tian
Pei Liu
Depeng Dang
Yanjing Su
Turab Lookman
Jianxin Xie
Source :
npj Computational Materials, Vol 8, Iss 1, Pp 1-12 (2022)
Publication Year :
2022
Publisher :
Nature Portfolio, 2022.

Abstract

Abstract Data provides a foundation for machine learning, which has accelerated data-driven materials design. The scientific literature contains a large amount of high-quality, reliable data, and automatically extracting data from the literature continues to be a challenge. We propose a natural language processing pipeline to capture both chemical composition and property data that allows analysis and prediction of superalloys. Within 3 h, 2531 records with both composition and property are extracted from 14,425 articles, covering γ′ solvus temperature, density, solidus, and liquidus temperatures. A data-driven model for γ′ solvus temperature is built to predict unexplored Co-based superalloys with high γ′ solvus temperatures within a relative error of 0.81%. We test the predictions via synthesis and characterization of three alloys. A web-based toolkit as an online open-source platform is provided and expected to serve as the basis for a general method to search for targeted materials using data extracted from the literature.

Details

Language :
English
ISSN :
20573960
Volume :
8
Issue :
1
Database :
Directory of Open Access Journals
Journal :
npj Computational Materials
Publication Type :
Academic Journal
Accession number :
edsdoj.63cfae138d74aa7a69d180f199b83c2
Document Type :
article
Full Text :
https://doi.org/10.1038/s41524-021-00687-2