Search

Your search keyword '"Mangalam, Karttikeya"' showing total 101 results

Search Constraints

Start Over You searched for: Author "Mangalam, Karttikeya" Remove constraint Author: "Mangalam, Karttikeya"
101 results on '"Mangalam, Karttikeya"'

Search Results

1. Adaptive Human Trajectory Prediction via Latent Corridors

2. LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

3. xT: Nested Tokenization for Larger Context in Large Images

4. Do Vision and Language Encoders Represent the World Similarly?

5. Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

6. Adaptive Human Trajectory Prediction via Latent Corridors

7. Sequential Modeling Enables Scalable Learning for Large Vision Models

8. EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding

9. PaReprop: Fast Parallelized Reversible Backpropagation

10. Diffusion Models as Masked Autoencoders

11. Speculative Decoding with Big Little Decoder

12. Reversible Vision Transformers

13. Re-evaluating the Need for Multimodal Signals in Unsupervised Grammar Induction

14. Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization

15. Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022

16. Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

17. Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

18. MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

19. Overcoming Mode Collapse with Adaptive Multi Adversarial Training

20. MViTv2: Improved Multiscale Vision Transformers for Classification and Detection

21. Ego4D: Around the World in 3,000 Hours of Egocentric Video

22. Object-Region Video Transformers

23. LOKI: Long Term and Key Intentions for Trajectory Prediction

24. Multiscale Vision Transformers

25. From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting

26. Long-term Human Motion Prediction with Scene Context

27. It Is Not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction

28. Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision

29. On Compressing U-net Using Knowledge Distillation

30. Learning Spontaneity to Improve Emotion Recognition In Speech

31. Future Person Localization in First-Person Videos

32. Bitwise Operations of Cellular Automaton on Gray-scale Images

33. Perceiving People over Long Periods: Algorithms, Architectures & Datasets

38. A Vision-free Baseline for Multimodal Grammar Induction

39. Big Little Transformer Decoder

40. Object-Region Video Transformers

42. Ego4D: Around the World in 3,000 Hours of Egocentric Video

44. Reversible Vision Transformers

45. Does unsupervised grammar induction need pixels?

47. Multiscale Vision Transformers

Catalog

Books, media, physical & digital resources