Cite
Align vision-language semantics by multi-task learning for multi-modal summarization.
MLA
Cui, Chenhao, et al. “Align Vision-Language Semantics by Multi-Task Learning for Multi-Modal Summarization.” Neural Computing & Applications, vol. 36, no. 25, Sept. 2024, pp. 15653–66. EBSCOhost, https://doi.org/10.1007/s00521-024-09908-3.
APA
Cui, C., Liang, X., Wu, S., & Li, Z. (2024). Align vision-language semantics by multi-task learning for multi-modal summarization. Neural Computing & Applications, 36(25), 15653–15666. https://doi.org/10.1007/s00521-024-09908-3
Chicago
Cui, Chenhao, Xinnian Liang, Shuangzhi Wu, and Zhoujun Li. 2024. “Align Vision-Language Semantics by Multi-Task Learning for Multi-Modal Summarization.” Neural Computing & Applications 36 (25): 15653–66. doi:10.1007/s00521-024-09908-3.