Search

Your search keyword '"Han, Jianhua"' showing total 619 results

Search Constraints

Start Over You searched for: Author "Han, Jianhua" Remove constraint Author: "Han, Jianhua"
619 results on '"Han, Jianhua"'

Search Results

1. HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

2. HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

3. DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

4. LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

5. NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

6. From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs

7. Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

8. PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

9. G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

10. Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving

12. Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

13. Implicit Concept Removal of Diffusion Models

14. HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving

15. Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images

16. GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training

17. DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

19. CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation

20. Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards

21. DetGPT: Detect What You Need via Reasoning

22. DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment

23. CLIP$^2$: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data

24. Towards Universal Vision-language Omni-supervised Segmentation

25. CapDet: Unifying Dense Captioning and Open-World Detection Pretraining

26. Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving

28. NLIP: Noise-robust Language-Image Pre-training

29. Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection

30. Generative Negative Text Replay for Continual Vision-Language Pretraining

31. DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection

32. Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving

33. Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding

34. Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing

35. ONCE-3DLanes: Building Monocular 3D Lane Detection

36. Laneformer: Object-aware Row-Column Transformers for Lane Detection

37. CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving

40. GAP43-dependent mitochondria transfer from astrocytes enhances glioblastoma tumorigenicity

41. SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving

42. Structural Innovation and Theoretical Optimization of Modern Cultural Industry Tax Administration Based on Lasso Regression Algorithm

46. Dissecting the structure-stability relationship of Y-series electron acceptors for real-world solar cell applications

Catalog

Books, media, physical & digital resources