Back to Search Start Over

A Collaborative AI-Enabled Pretrained Language Model for AIoT Domain Question Answering.

Authors :
Zhu, Hongyin
Tiwari, Prayag
Ghoneim, Ahmed
Hossain, M. Shamim
Source :
IEEE Transactions on Industrial Informatics; May2022, Vol. 18 Issue 5, p3387-3396, 10p
Publication Year :
2022

Abstract

Large-scale knowledge in the artificial intelligence of things (AIoT) field urgently needs effective models to understand human language and automatically answer questions. Pretrained language models achieve state-of-the-art performance on some question answering (QA) datasets, but few models can answer questions on AIoT domain knowledge. Currently, the AIoT domain lacks sufficient QA datasets and large-scale pretraining corpora. In this article, we propose RoBERTa $_{\mathrm AIoT}$ to address the problem of the lack of high-quality large-scale labeled AIoT QA datasets. We construct an AIoT corpus to further pretrain RoBERTa and BERT. RoBERTa $_{\mathrm AIoT}$ and BERT $_{\mathrm AIoT}$ leverage unsupervised pretraining on a large corpus composed of AIoT-oriented Wikipedia webpages to learn more domain-specific context and improve performance on the AIoT QA tasks. To fine-tune and evaluate the model, we construct three AIoT QA datasets based on the community QA websites. We evaluate our approach on these datasets, and the experimental results demonstrate the significant improvements of our approach. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15513203
Volume :
18
Issue :
5
Database :
Complementary Index
Journal :
IEEE Transactions on Industrial Informatics
Publication Type :
Academic Journal
Accession number :
155108394
Full Text :
https://doi.org/10.1109/TII.2021.3097183