Search

Your search keyword '"Feng, Jiashi"' showing total 923 results

Search Constraints

Start Over You searched for: Author "Feng, Jiashi" Remove constraint Author: "Feng, Jiashi"
923 results on '"Feng, Jiashi"'

Search Results

1. Hierarchical Memory for Long Video QA

2. Depth Anything V2

3. Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams

4. Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations

5. DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

6. InstaDrag: Lightning Fast and Accurate Drag-based Image Editing Emerging from Videos

7. PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator

8. StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

9. PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning

10. Magic-Boost: Boost 3D Generation with Mutli-View Conditioned Diffusion

11. Magic-Me: Identity-Specific Video Customized Diffusion

12. Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

13. MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

15. Harnessing Diffusion Models for Visual Perception with Meta Prompts

16. Video Recognition in Portrait Mode

17. DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation

18. Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method

19. Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens

20. PixelLM: Pixel Reasoning with Large Multimodal Model

21. XAGen: 3D Expressive Human Avatars Generation

22. MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration

23. ChatAnything: Facetime Chat with LLM-Enhanced Personas

24. EPIM: Efficient Processing-In-Memory Accelerators based on Epitome

25. Low-Resolution Self-Attention for Semantic Segmentation

26. GETAvatar: Generative Textured Meshes for Animatable Human Avatars

28. MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask

29. MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation

30. MagicAvatar: Multimodal Avatar Generation and Animation

31. MagicEdit: High-Fidelity and Temporally Coherent Video Editing

32. Dataset Quantization

33. AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

34. BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

35. COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

36. Delving Deeper into Data Scaling in Masked Image Modeling

37. VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending

38. Associating Spatially-Consistent Grouping with Text-supervised Semantic Segmentation

39. DOAD: Decoupled One Stage Action Detection Network

40. OmniAvatar: Geometry-Guided Controllable 3D Head Synthesis

41. AgileGAN3D: Few-Shot 3D Portrait Stylization by Augmented Transfer Learning

42. TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision

43. Global Knowledge Calibration for Fast Open-Vocabulary Segmentation

44. Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring

45. MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

46. Temporal Perceiving Video-Language Pre-training

47. CMAE-V: Contrastive Masked Autoencoders for Video Action Recognition

48. Class Prototype-based Cleaner for Label Noise Learning

49. PV3D: A 3D Generative Model for Portrait Video Generation

50. Diffusion Probabilistic Model Made Slim

Catalog

Books, media, physical & digital resources