Search

Your search keyword '"Essa, Irfan"' showing total 500 results

Search Constraints

Start Over You searched for: Author "Essa, Irfan" Remove constraint Author: "Essa, Irfan"
500 results on '"Essa, Irfan"'

Search Results

1. MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation

2. Calibrated Multi-Preference Optimization for Aligning Diffusion Models

3. Learning Complex Non-Rigid Image Edits from Multimodal Conditioning

4. AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset

5. Exploring Efficient Foundational Multi-modal Models for Video Summarization

6. Mamba Fusion: Learning Actions Through Questioning

7. Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition -- And Ways to Overcome Them

8. Cropper: Vision-Language Model for Image Cropping through In-Context Learning

9. Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation

10. CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers

11. SLAIM: Robust Dense Neural SLAM for Online Tracking and Mapping

12. 3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D

13. On the Efficacy of Text-Based Input Modalities for Action Anticipation

14. Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

15. VideoPoet: A Large Language Model for Zero-Shot Video Generation

16. Photorealistic Video Generation with Diffusion Models

17. BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning

18. Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

19. Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement

20. Automatic Multi-Path Web Story Creation from a Structural Article

21. Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition

22. SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs

23. Learning Disentangled Prompts for Compositional Image Synthesis

24. Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition

25. StyleDrop: Text-to-Image Generation in Any Style

26. Prompt-Free Diffusion: Taking 'Text' out of Text-to-Image Diffusion Models

27. Tackling Hate Speech in Low-resource Languages with Context Experts

28. MaskSketch: Unpaired Structure-guided Masked Image Generation

29. Emergence of Maps in the Memories of Blind Navigation Agents

30. Cascaded Compositional Residual Learning for Complex Interactive Behaviors

31. MAGVIT: Masked Generative Video Transformer

32. Investigating Enhancements to Contrastive Predictive Coding for Human Activity Recognition

33. Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition

34. End-to-End Multimodal Representation Learning for Video Dialog

35. Video based Object 6D Pose Estimation using Transformers

36. Finding Islands of Predictability in Action Forecasting

37. VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement

38. Visual Prompt Tuning for Generative Transfer Learning

39. Improved Masked Image Generation with Token-Critic

40. Assessing the State of Self-Supervised Human Activity Recognition using Wearables

41. Learning Temporal Rules from Noisy Timeseries Data

42. BLT: Bidirectional Layout Transformer for Controllable Layout Generation

43. VideoPose: Estimating 6D object pose from videos

44. Discrete Representations Strengthen Vision Transformer Robustness

45. Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning

46. Unsupervised Discovery of Actions in Instructional Videos

47. Unsupervised Action Segmentation for Instructional Videos

48. Automatic Non-Linear Video Editing Transfer

49. PLAN-B: Predicting Likely Alternative Next Best Sequences for Action Prediction

50. How to Train PointGoal Navigation Agents on a (Sample and Compute) Budget

Catalog

Books, media, physical & digital resources