21 results on '"James M. Rehg"'
Search Results
2. Listen to Look Into the Future: Audio-Visual Egocentric Gaze Anticipation.
3. The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective.
4. LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs.
5. Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations.
6. Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
7. ZeroShape: Regression-Based Zero-Shot Shape Reconstruction.
8. PointInfinity: Resolution-Invariant Point Diffusion Models.
9. RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.
10. MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding.
11. 3˟ 2: 3D Object Part Segmentation by 2D Semantic Correspondences.
12. In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation and Beyond.
13. Human Action Anticipation: A Survey.
14. Towards Social AI: A Survey on Understanding Social Interactions.
15. Leveraging Object Priors for Point Tracking.
16. 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences.
17. What is the Visual Cognition Gap between Humans and Multimodal LLMs?
18. MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs.
19. Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation.
20. PointInfinity: Resolution-Invariant Point Diffusion Models.
21. Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.