Back to Search Start Over

E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models

Authors :
Tan, Zhiyu
Qian, WenXu
Chen, Hesen
Yang, Mengping
Chen, Lei
Li, Hao
Publication Year :
2024

Abstract

Diffusion models have emerged as a powerful framework for generative modeling, achieving state-of-the-art performance across various tasks. However, they face several inherent limitations, including a training-sampling gap, information leakage in the progressive noising process, and the inability to incorporate advanced loss functions like perceptual and adversarial losses during training. To address these challenges, we propose an innovative end-to-end training framework that aligns the training and sampling processes by directly optimizing the final reconstruction output. Our method eliminates the training-sampling gap, mitigates information leakage by treating the training process as a direct mapping from pure noise to the target data distribution, and enables the integration of perceptual and adversarial losses into the objective. Extensive experiments on benchmarks such as COCO30K and HW30K demonstrate that our approach consistently outperforms traditional diffusion models, achieving superior results in terms of FID and CLIP score, even with reduced sampling steps. These findings highlight the potential of end-to-end training to advance diffusion-based generative models toward more robust and efficient solutions.<br />Comment: technical report, to be further updated

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2412.21044
Document Type :
Working Paper