Back to Search Start Over

Modular and Parameter-Efficient Multimodal Fusion with Prompting

Authors :
Liang, Sheng
Zhao, Mengjie
Schütze, Hinrich
Publication Year :
2022

Abstract

Recent research has made impressive progress in large-scale multimodal pre-training. In the context of the rapid growth of model size, it is necessary to seek efficient and flexible methods other than finetuning. In this paper, we propose to use prompt vectors to align the modalities. Our method achieves comparable performance to several other multimodal fusion methods in low-resource settings. We further show that our method is modular and parameter-efficient for processing tasks involving two or more data modalities.<br />Comment: Accepted to Findings of ACL 2022

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2203.08055
Document Type :
Working Paper