Search

Your search keyword '"Zhao, Hengshuang"' showing total 280 results

Search Constraints

Start Over You searched for: Author "Zhao, Hengshuang" Remove constraint Author: "Zhao, Hengshuang"
280 results on '"Zhao, Hengshuang"'

Search Results

1. Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning

2. One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection

3. UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation

4. VIRT: Vision Instructed Transformer for Robotic Manipulation

5. EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

6. LION: Linear Group RNN for 3D Object Detection in Point Clouds

7. Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation

8. ViLLa: Video Reasoning Segmentation with Large Language Model

9. LogoSticker: Inserting Logos into Diffusion Models for Customized Generation

10. OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

11. HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

12. Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

13. Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

14. Depth Anything V2

15. Zero-shot Image Editing with Reference Imitation

16. LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

17. OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

18. Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting

19. OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation

20. GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

21. Towards Unified 3D Object Detection via Algorithm and Data Unification

22. OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

23. Memory Consistency Guided Divide-and-Conquer Learning for Generalized Category Discovery

24. Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

25. Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

26. Self-supervised Learning for Enhancing Geometrical Modeling in 3D-Aware Generative Adversarial Network

27. Point Transformer V3: Simpler, Faster, Stronger

28. VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

29. CorresNeRF: Image Correspondence Priors for Neural Radiance Fields

30. TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation

31. DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

32. GPT4Point: A Unified Framework for Point-Language Understanding and Generation

33. LivePhoto: Real Image Animation with Text-guided Motion Control

34. OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

35. Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

36. LivePhoto: Real Image Animation with Text-Guided Motion Control

37. A Lightweight Clustering Framework for Unsupervised Semantic Segmentation

38. Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding

39. FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models

40. PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

41. UniPAD: A Universal Pre-training Paradigm for Autonomous Driving

42. Uni3DETR: Unified 3D Detection Transformer

43. DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model

44. OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

45. Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training

46. InsMapper: Exploring Inner-instance Information for Vectorized HD Mapping

47. Shrinking Class Space for Enhanced Certainty in Semi-Supervised Learning

48. AnyDoor: Zero-shot Object-level Image Customization

49. GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping

50. SAM3D: Segment Anything in 3D Scenes

Catalog

Books, media, physical & digital resources