195 results on '"James M. Rehg"'
Search Results
2. Listen to Look Into the Future: Audio-Visual Egocentric Gaze Anticipation.
3. The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective.
4. LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs.
5. Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations.
6. Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
7. ZeroShape: Regression-Based Zero-Shot Shape Reconstruction.
8. PointInfinity: Resolution-Invariant Point Diffusion Models.
9. RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.
10. MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding.
11. 3˟ 2: 3D Object Part Segmentation by 2D Semantic Correspondences.
12. ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-Based Consistency.
13. Egocentric Auditory Attention Localization in Conversations.
14. Explaining a machine learning decision to physicians via counterfactuals.
15. Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduction Games.
16. Which way is 'right'?: Uncovering limitations of Vision-and-Language Navigation Models.
17. Transformer-based Localization from Embodied Dialog with Large-scale Pre-training.
18. Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
19. Planes vs. Chairs: Category-Guided 3D Shape Learning Without any 3D Cues.
20. Generative Adversarial Network for Future Hand Segmentation from Egocentric Video.
21. Egocentric Activity Recognition and Localization on a 3D Map.
22. The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction.
23. Low-shot Object Learning with Mutual Exclusivity Bias.
24. No RL, No Simulation: Learning to Navigate without Navigating.
25. Discriminative Appearance Modeling With Multi-Track Pooling for Real-Time Multi-Object Tracking.
26. Orthogonal Over-Parameterized Training.
27. Using Shape To Categorize: Low-Shot Learning With an Explicit Shape Bias.
28. Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning.
29. 4D Human Body Capture from Egocentric Video via 3D Scene Grounding.
30. 3D Reconstruction of Novel Object Shapes from Single Images.
31. Where Are You? Localization from Embodied Dialog.
32. Detecting Attended Visual Targets in Video.
33. Regularizing Neural Networks via Minimizing Hyperspherical Energy.
34. Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video.
35. Learning Dense Object Descriptors from Multiple Views for Low-shot Category Generalization.
36. Kernel Multimodal Continuous Attention.
37. PulseImpute: A Novel Benchmark Task for Pulsative Physiological Signal Imputation.
38. Neural Similarity Learning.
39. Taking a Deeper Look at the Inverse Compositional Algorithm.
40. Learning to Generate Synthetic Data via Compositing.
41. Incremental Object Learning From Contiguous Views.
42. Unsupervised 3D Pose Estimation With Geometric Self-Supervision.
43. A Spatiotemporal Approach to Predicting Glaucoma Progression Using a CT-HMM.
44. Locally Weighted Regression Pseudo-Rehearsal for Adaptive Model Predictive Control.
45. Towards Accurate 3D Human Body Reconstruction from Silhouettes.
46. Attention Distillation for Learning Video Representations.
47. Tripping through time: Efficient Localization of Activities in Videos.
48. A Robust Functional EM Algorithm for Incomplete Panel Count Data.
49. Enhancing Cognitive Assessment through Multimodal Sensing: A Case Study Using the Block Design Test.
50. In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.