Search

Your search keyword '"Fu, Yun"' showing total 5,127 results

Search Constraints

Start Over You searched for: Author "Fu, Yun" Remove constraint Author: "Fu, Yun"
5,127 results on '"Fu, Yun"'

Search Results

1. Accessing Vision Foundation Models at ImageNet-level Costs

2. SoupLM: Model Integration in Large Language and Multi-Modal Models

3. Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

4. Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT

5. Phased Consistency Model

6. Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent

7. Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

8. Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

9. Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

10. OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising

11. Adapting to Length Shift: FlexiLength Network for Trajectory Prediction

12. Rewrite the Stars

13. Efficient Modulation for Vision Networks

14. Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

15. Don't Judge by the Look: Towards Motion Coherent Video Representation

16. AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning

17. Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

18. VaQuitA: Enhancing Alignment in LLM-Assisted Video Understanding

20. Exploring Question Decomposition for Zero-Shot VQA

21. Layout Sequence Prediction From Noisy Mobile Modality

22. Latent Graph Inference with Limited Supervision

23. Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection

24. BEV-DG: Cross-Modal Learning under Bird's-Eye View for Domain Generalization of 3D Semantic Segmentation

25. Citing as an Online Learning Support Tool for Student-Generated Assessment

26. A Systematic Review of Published Student Question-Generation Systems: Supporting Functionalities and Design Features

30. Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

31. SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds

32. Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising

33. UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

34. Uncovering the Missing Pattern: Unified Framework Towards Trajectory Imputation and Prediction

35. Frame Flexible Network

36. Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning

37. GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation

38. Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution

39. Image as Set of Points

40. Explainable Anomaly Detection in Images and Videos: A Survey

43. Making Reconstruction-based Method Great Again for Video Anomaly Detection

44. Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning

45. An Automatic Method for Generating Symbolic Expressions of Zernike Circular Polynomials

46. A Close Look at Spatial Modeling: From Attention to Convolution

47. CDIO-CT collaborative strategy for solving complex STEM problems in system modeling and simulation: an illustration of solving the period of mathematical pendulum

48. Real-Time Neural Light Field on Mobile Devices

49. NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-shot Real Image Animation

50. Look More but Care Less in Video Recognition

Catalog

Books, media, physical & digital resources