Search

Your search keyword '"Zhao, Bo"' showing total 15,322 results

Search Constraints

Start Over You searched for: Author "Zhao, Bo" Remove constraint Author: "Zhao, Bo"
15,322 results on '"Zhao, Bo"'

Search Results

5. Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?

6. Emu3: Next-Token Prediction is All You Need

7. Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding

8. Automated design of nonreciprocal thermal emitters via Bayesian optimization

9. Enhancing Long Video Understanding via Hierarchical Event-Based Memory

10. TC-LLaVA: Rethinking the Transfer from Image to Video Understanding with Temporal Considerations

11. 52B to 1T: Lessons Learned via Tele-FLM Series

12. PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

13. 2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

14. SpatialBot: Precise Spatial Understanding with Vision Language Models

15. Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

16. Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking

17. VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval

18. MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding

19. Filamentary Hierarchies and Superbubbles: Galactic Multiscale MHD Simulations of GMC to Star Cluster Formation

20. The SkatingVerse Workshop & Challenge: Methods and Results

21. VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

22. Efficient Multimodal Large Language Models: A Survey

23. Eliminating nearfield coupling in dense high quality factor phase gradient metasurfaces

24. Large Language Model-aided Edge Learning in Distribution System State Estimation

25. Understanding the Difficulty of Solving Cauchy Problems with PINNs

26. FlexiFilm: Long Video Generation with Flexible Conditions

27. Tele-FLM Technical Report

28. Advances and Open Challenges in Federated Foundation Models

29. Stable Acceleration of a LHe-Free Nb3Sn demo SRF e-linac Based on Conduction Cooling

30. M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models

31. Efficient size-prescribed $k$-core search

39. SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model

40. Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability

41. Efficient Multimodal Learning from Data-centric Perspective

42. RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model

43. Spin: An Efficient Secure Computation Framework with GPU Acceleration

44. Distributional Counterfactual Explanations With Optimal Transport

45. Learning Position-Aware Implicit Neural Network for Real-World Face Inpainting

46. Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections

47. Open-DDVM: A Reproduction and Extension of Diffusion Model for Optical Flow Estimation

Catalog

Books, media, physical & digital resources