Search

Your search keyword '"Cherian, Anoop"' showing total 253 results

Search Constraints

Start Over You searched for: Author "Cherian, Anoop" Remove constraint Author: "Cherian, Anoop"
253 results on '"Cherian, Anoop"'

Search Results

1. Temporally Grounding Instructional Diagrams in Unconstrained Videos

2. Disentangled Acoustic Fields For Multimodal Physical Scene Understanding

3. Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

4. TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

5. Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection

6. Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

7. Pixel-Grounded Prototypical Part Networks

8. CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments

9. HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions

10. Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

11. Are Deep Neural Networks SMARTer than Second Graders?

12. Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation

13. H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions

14. AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments

15. (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering

16. Max-Margin Contrastive Learning

17. MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation

18. Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

19. A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction

20. Visual Scene Graphs for Audio Source Separation

21. InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

22. Generalized One-Class Learning Using Pairs of Complementary Classifiers

23. Learning Log-Determinant Divergences for Positive Definite Matrices

24. Tensor Representations for Action Recognition

25. First-Order Optimization Inspired from Finite-Time Convergent Flows

26. Sound2Sight: Generating Visual Dynamics from Sound and Context

27. Representation Learning via Adversarially-Contrastive Optimal Transport

28. Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers

29. Dense Non-Rigid Structure from Motion: A Manifold Viewpoint

30. Inferring Temporal Compositions of Actions Using Probabilistic Automata

31. LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood

32. Spatio-Temporal Ranked-Attention Networks for Video Captioning

33. The Eighth Dialog System Technology Challenge

34. Discriminative Video Representation Learning Using Support Vector Classifiers

35. GODS: Generalized One-class Discriminative Subspaces for Anomaly Detection

36. Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function

37. Audio-Visual Scene-Aware Dialog

40. Contrastive Video Representation Learning via Adversarial Perturbations

41. Sem-GAN: Semantically-Consistent Image-to-Image Translation

42. End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features

43. Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7

44. Non-Linear Temporal Subspace Representations for Activity Recognition

45. Video Representation Learning Using Discriminative Pooling

46. Scalable Dense Non-rigid Structure-from-Motion: A Grassmannian Perspective

47. Neural Algebra of Classifiers

48. Human Action Forecasting by Learning Task Grammars

49. Learning Discriminative Alpha-Beta-divergence for Positive Definite Matrices (Extended Version)

50. Human Pose Forecasting via Deep Markov Models

Catalog

Books, media, physical & digital resources