Search

Your search keyword '"yang, Ming"' showing total 67,868 results

Search Constraints

Start Over You searched for: Author "yang, Ming" Remove constraint Author: "yang, Ming"
67,868 results on '"yang, Ming"'

Search Results

1. Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

2. Zeoformer: Coarse-Grained Periodic Graph Transformer for OSDA-Zeolite Affinity Prediction

3. Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration

4. Crystalline Material Discovery in the Era of Artificial Intelligence

5. Cropper: Vision-Language Model for Image Cropping through In-Context Learning

6. Social Debiasing for Fair Multi-modal LLMs

7. ControlNeXt: Powerful and Efficient Control for Image and Video Generation

8. Egocentric Vision Language Planning

9. ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning

10. POA: Pre-training Once for Models of All Sizes

11. LLAVADI: What Matters For Multimodal Large Language Models Distillation

12. Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

13. MapLocNet: Coarse-to-Fine Feature Registration for Visual Re-Localization in Navigation Maps

14. BLOS-BEV: Navigation Map Enhanced Lane Segmentation Network, Beyond Line of Sight

15. Learning Spatial-Semantic Features for Robust Video Object Segmentation

16. Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

17. Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification

18. Hybrid Feature Collaborative Reconstruction Network for Few-Shot Fine-Grained Image Classification

19. Domain Generalizable Knowledge Tracing via Concept Aggregation and Relation-Based Attention

20. Multi-level Reliable Guidance for Unpaired Multi-view Clustering

21. Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval

22. Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model

23. PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

24. Crowd-Sourced NeRF: Collecting Data from Production Vehicles for 3D Street View Reconstruction

25. SS-ADA: A Semi-Supervised Active Domain Adaptation Framework for Semantic Segmentation

26. Compressed Sensor Caching and Collaborative Sparse Data Recovery with Anchor Alignment

27. 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation

28. Monocular Localization with Semantics Map for Autonomous Vehicles

29. DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection

30. HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios

31. SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

32. Sharing Key Semantics in Transformer Makes Efficient Image Restoration

33. Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model

34. Explainable Molecular Property Prediction: Aligning Chemical Concepts with Predictions via Language Models

35. Efficient Visual State Space Model for Image Deblurring

36. AMFD: Distillation via Adaptive Multimodal Fusion for Multispectral Pedestrian Detection

37. Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

38. Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance

39. Unpaired Multi-view Clustering via Reliable View Guidance

40. SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval

41. Physics-Informed Neural Networks and Beyond: Enforcing Physical Constraints in Quantum Dissipative Dynamics

42. Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring

43. AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters

44. Taming Latent Diffusion Model for Neural Radiance Field Inpainting

45. No More Ambiguity in 360{\deg} Room Layout via Bi-Layout Estimation

46. Gaga: Group Any Gaussians via 3D-aware Memory Bank

47. Tianyu: search for the second solar system and explore the dynamic universe

48. Spatial-Temporal Multi-level Association for Video Object Segmentation

49. Mansformer: Efficient Transformer of Mixed Attention for Image Deblurring and Beyond

50. HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

Catalog

Books, media, physical & digital resources