Search

Your search keyword '"Zhang, Wenwei"' showing total 1,202 results

Search Constraints

Start Over You searched for: Author "Zhang, Wenwei" Remove constraint Author: "Zhang, Wenwei"
1,202 results on '"Zhang, Wenwei"'

Search Results

1. LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

2. SLAM assisted 3D tracking system for laparoscopic surgery

3. MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

4. CIBench: Evaluating Your LLMs with a Code Interpreter Plugin

5. 4D Contrastive Superflows are Dense 3D Representation Learners

6. ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

7. InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

8. ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities

9. InternLM-Law: An Open Source Chinese Legal Large Language Model

10. F-LMM: Grounding Frozen Large Multimodal Models

11. ANAH: Analytical Annotation of Hallucinations in Large Language Models

12. AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data

13. Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving

14. An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models

15. MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark

16. The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

17. Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

18. InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

19. InternLM2 Technical Report

20. Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding

21. Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

22. CriticEval: Evaluating Large Language Model as Critic

23. Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

24. InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

25. Global marine microbial diversity and its potential in bioprospecting

27. InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

28. Can AI Assistants Know What They Don't Know?

29. OMG-Seg: Is One Model Good Enough For All Segmentation?

30. EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

31. T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step

32. CLIM: Contrastive Language-Image Mosaic for Region Representation

33. Mixed Pseudo Labels for Semi-Supervised Object Detection

34. Fake Alignment: Are LLMs Really Aligned Well?

35. 4D Contrastive Superflows are Dense 3D Representation Learners

39. OV-PARTS: Towards Open-Vocabulary Part Segmentation

40. Evaluating Hallucinations in Chinese Large Language Models

41. CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

42. DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection

43. InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

44. Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection

45. Unified Human-Scene Interaction via Prompted Chain-of-Contacts

46. GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

47. Segment Any Point Cloud Sequences by Distilling Vision Foundation Models

48. MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Catalog

Books, media, physical & digital resources