Search

Your search keyword '"Wang, Yaohui"' showing total 32 results

Search Constraints

Start Over You searched for: Author "Wang, Yaohui" Remove constraint Author: "Wang, Yaohui" Database arXiv Remove constraint Database: arXiv
32 results on '"Wang, Yaohui"'

Search Results

1. Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

2. Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

3. DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

4. Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion

5. 4Diffusion: Multi-view Video Diffusion Model for 4D Generation

6. DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

7. Image Reconstruction with B0 Inhomogeneity using an Interpretable Deep Unrolled Network on an Open-bore MRI-Linac

8. Latte: Latent Diffusion Transformer for Video Generation

9. DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

10. Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model

11. EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

12. SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

13. ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation

14. LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

15. LAC: Latent Action Composition for Skeleton-based Action Segmentation

16. WeldMon: A Cost-effective Ultrasonic Welding Machine Condition Monitoring System

17. InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

18. AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

19. Self-Supervised Video Representation Learning via Latent Time Navigation

20. LEO: Generative Latent Image Animator for Human Video Synthesis

21. Long-Term Rhythmic Video Soundtracker

22. Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

23. Learning Invariance from Generated Variance for Unsupervised Person Re-identification

24. 3D-EPI Blip-Up/Down Acquisition (BUDA) with CAIPI and Joint Hankel Structured Low-Rank Reconstruction for Rapid Distortion-Free High-Resolution T2* Mapping

25. ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting

26. Latent Image Animator: Learning to Animate Images via Latent Space Navigation

27. UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition

28. Fast Outage Analysis of Large-scale Production Clouds with Service Correlation Mining

29. InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation

30. Joint Generative and Contrastive Learning for Unsupervised Person Re-identification

31. Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos

32. G3AN: Disentangling Appearance and Motion for Video Generation

Catalog

Books, media, physical & digital resources