Search

Your search keyword '"Kang, Bingyi"' showing total 257 results

Search Constraints

Start Over You searched for: Author "Kang, Bingyi" Remove constraint Author: "Kang, Bingyi"
257 results on '"Kang, Bingyi"'

Search Results

1. Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

2. VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

3. Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

4. Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

5. Image Understanding Makes for A Good Tokenizer for Image Generation

6. Classification Done Right for Vision-Language Pre-Training

7. How Far is Video Generation from World Model: A Physical Law Perspective

8. DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

9. Loong: Generating Minute-level Long Videos with Autoregressive Language Models

10. Depth Anything V2

11. Improving Token-Based World Models with Parallel Observation Prediction

12. Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

13. Harnessing Diffusion Models for Visual Perception with Meta Prompts

14. FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models

15. Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

16. BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

17. Decoupled Prioritized Resampling for Offline RL

18. Improving and Benchmarking Offline Reinforcement Learning Algorithms

19. Efficient Diffusion Policies for Offline Reinforcement Learning

20. MADiff: Offline Multi-agent Learning with Diffusion Models

21. Bag of Tricks for Training Data Extraction from Language Models

22. Boosting Offline Reinforcement Learning via Data Rebalancing

23. Mutual Information Regularized Offline Reinforcement Learning

24. Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

29. Deep Long-Tailed Learning: A Survey

32. Refiner: Refining Self-attention for Vision Transformers

33. DeepViT: Towards Deeper Vision Transformer

35. Improving Generalization in Reinforcement Learning with Mixture Regularization

36. Few-shot Classification via Adaptive Attention

37. The Devil is in Classification: A Simple Framework for Long-tail Object Detection and Instance Segmentation

38. Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

40. Classification Calibration for Long-tail Instance Segmentation

41. Exploring Simple and Transferable Recognition-Aware Image Processing

42. Decoupling Representation and Classifier for Long-Tailed Recognition

43. Regularization Matters in Policy Optimization

49. Similarity R-C3D for Few-shot Temporal Activity Detection

50. Few-shot Object Detection via Feature Reweighting

Catalog

Books, media, physical & digital resources