Search

Your search keyword '"Zhou, Jie"' showing total 288 results

Search Constraints

Start Over You searched for: Author "Zhou, Jie" Remove constraint Author: "Zhou, Jie" Topic computer science - computer vision and pattern recognition Remove constraint Topic: computer science - computer vision and pattern recognition
288 results on '"Zhou, Jie"'

Search Results

1. XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation

2. V2M: Visual 2-Dimensional Mamba for Image Representation Learning

3. GlobalMamba: Global Image Serialization for Vision Mamba

4. SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation

5. Q-VLM: Post-training Quantization for Large Vision-Language Models

6. OPONeRF: One-Point-One NeRF for Robust Neural Rendering

7. MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation

8. FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner

9. AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularity

10. POINTS: Improving Your Vision-language Model with Affordable Strategies

11. DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

12. EmbodiedSAM: Online Segment Any 3D Thing in Real Time

13. Scene-wise Adaptive Network for Dynamic Cold-start Scenes Optimization in CTR Prediction

14. MiniCPM-V: A GPT-4V Level MLLM on Your Phone

15. UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

16. Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task

17. Camera-LiDAR Cross-modality Gait Recognition

18. Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

19. Learning 1D Causal Visual Representation with De-focus Attention Networks

20. Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

21. FlowIE: Efficient Image Enhancement via Rectified Flow

22. GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

23. NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation

24. Rethinking Overlooked Aspects in Vision-Language Models

25. Joint Identity Verification and Pose Alignment for Partial Fingerprints

26. Latent Fingerprint Matching via Dense Minutia Descriptor

27. Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors

28. Regression of Dense Distortion Field from a Single Fingerprint Image

29. Phase-aggregated Dual-branch Network for Efficient Fingerprint Dense Registration

30. Pose-Specific 3D Fingerprint Unfolding

31. Direct Regression of Distortion Field from a Single Fingerprint Image

32. NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

33. CodeEnhance: A Codebook-Driven Approach for Low-Light Image Enhancement

34. LOGO: A Long-Form Video Dataset for Group Action Quality Assessment

35. DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

36. Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

37. RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists

38. Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution

39. Memory-based Adapters for Online 3D Scene Perception

40. 3D Vascular Segmentation Supervised by 2D Annotation of Maximum Intensity Projection

41. MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

42. Path Choice Matters for Clear Attribution in Path Methods

43. Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

44. Domain Similarity-Perceived Label Assignment for Domain Generalized Underwater Object Detection

45. Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

46. LiCamPose: Combining Multi-View LiDAR and RGB Cameras for Robust Single-frame 3D Human Pose Estimation

47. HumanReg: Self-supervised Non-rigid Registration of Human Point Cloud

48. LiDAR-based Person Re-identification

49. Fixed-length Dense Descriptor for Efficient Fingerprint Matching

50. SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

Catalog

Books, media, physical & digital resources