Search

Your search keyword '"Zhao, Hang"' showing total 138 results

Search Constraints

Start Over You searched for: Author "Zhao, Hang" Remove constraint Author: "Zhao, Hang" Publication Type Reports Remove constraint Publication Type: Reports
138 results on '"Zhao, Hang"'

Search Results

1. VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion

2. Explaining Context Length Scaling and Bounds for Language Models

3. Embrace Collisions: Humanoid Shadowing for Deployable Contact-Agnostics Motions

4. When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation

5. A Unit-based System and Dataset for Expressive Direct Speech-to-Speech Translation

6. Minimal subshifts of prescribed mean dimension over general alphabets

7. Advancing Single- and Multi-task Text Classification through Large Language Model Fine-tuning

8. Invariant tori for a class of affined Anosov mappings with quasi-periodic forces

9. ToxiLab: How Well Do Open-Source LLMs Generate Synthetic Toxicity Data?

10. Generalizing Motion Planners with Mixture of Experts for Autonomous Driving

11. Playful DoggyBot: Learning Agile and Precise Quadrupedal Locomotion

12. ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information

13. CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction

14. Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

15. Robust Robot Walker: Learning Agile Locomotion over Tiny Traps

16. SARO: Space-Aware Robot System for Terrain Crossing via Vision-Language Model

17. Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation

18. DISCO: Efficient Diffusion Solver for Large-Scale Combinatorial Optimization Problems

19. GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory

20. Humanoid Parkour Learning

21. TimeSieve: Extracting Temporal Dynamics through Information Bottlenecks

22. FTS: A Framework to Find a Faithful TimeSieve

23. Generating Comprehensive Lithium Battery Charging Data with Generative AI

24. P-MapNet: Far-seeing Map Generator Enhanced by both SDMap and HDMap Priors

25. PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

26. DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models

27. MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning

28. PIXART-{\delta}: Fast and Controllable Image Generation with Latent Consistency Models

29. LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

30. Large Trajectory Models are Scalable Motion Predictors and Planners

31. LiDAR-based 4D Occupancy Completion and Forecasting

32. What Makes for Robust Multi-Modal Models in the Face of Missing Modalities?

33. Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

34. Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models

35. Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

36. GPT-Driver: Learning to Drive with GPT

37. Uncertainty-Aware Decision Transformer for Stochastic Driving Environments

38. AutoEncoding Tree for City Generation and Applications

39. Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills

40. Robot Parkour Learning

41. StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction

42. Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals

43. Learning-based Control for PMSM Using Distributed Gaussian Processes with Optimal Aggregation Strategy

44. Reconstructing Three-decade Global Fine-Grained Nighttime Light Observations by a New Super-Resolution Framework

45. Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

46. BEVScope: Enhancing Self-Supervised Depth Estimation Leveraging Bird's-Eye-View in Dynamic Scenarios

47. A Universal Semantic-Geometric Representation for Robotic Manipulation

48. SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving

49. ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory

50. GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training

Catalog

Books, media, physical & digital resources