Back to Search Start Over

AI-based Chinese-style music generation from video content: a study on cross-modal analysis and generation methods

Authors :
Moxi Cao
Jiaxiang Zheng
Chongbin Zhang
Source :
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2025, Iss 1, Pp 1-23 (2025)
Publication Year :
2025
Publisher :
SpringerOpen, 2025.

Abstract

Abstract In recent years, Artificial Intelligence Generated Content (AIGC) technologies have advanced rapidly, with models such as Stable Diffusion and GPT garnering significant attention across various domains. Against this backdrop, AI-driven music composition techniques have also produced significant progress. However, no existing model has yet demonstrated the capability to generate Chinese-style music corresponding to Chinese-style videos. To address this gap, this study proposes a novel Chinese-style video music generation model based on the Latent Diffusion Model (LDM) and Diffusion Transformers (DiT). Experimental results demonstrate that the proposed model generates Chinese-style music from Chinese-style videos and achieves performance comparable to the baseline models in audio quality, distribution fitting, musicality, rhythmic stability, and audio-visual synchronization. These findings indicate that the model captures the stylistic features of Chinese music. This research not only demonstrates the feasibility applications of artificial intelligence in music creation but also provides a new technological approach to preserve and innovate the traditional Chinese music culture in the digital era. Furthermore, it explores new possibilities for the dissemination and innovation of Chinese cultural arts in the digital age.

Details

Language :
English
ISSN :
16874722
Volume :
2025
Issue :
1
Database :
Directory of Open Access Journals
Journal :
EURASIP Journal on Audio, Speech, and Music Processing
Publication Type :
Academic Journal
Accession number :
edsdoj.207b0f9e67b8427c8cbc00c37f742cca
Document Type :
article
Full Text :
https://doi.org/10.1186/s13636-025-00397-3