Search

Your search keyword '"Kira, Zsolt"' showing total 320 results

Search Constraints

Start Over You searched for: Author "Kira, Zsolt" Remove constraint Author: "Kira, Zsolt" Publication Year Range Last 50 years Remove constraint Publication Year Range: Last 50 years
320 results on '"Kira, Zsolt"'

Search Results

1. From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

2. Grounding Descriptions in Images informs Zero-Shot Visual Recognition

3. Adversarial Attacks Using Differentiable Rendering: A Survey

4. Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models

5. Neural Fields in Robotics: A Survey

6. ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

7. Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

8. Reinforcement Learning via Auxiliary Task Distillation

9. ICE-G: Image Conditional Editing of 3D Gaussian Splats

10. Grounding Multimodal Large Language Models in Actions

11. Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control

12. Adaptive Memory Replay for Continual Learning

13. GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation

14. NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

15. N-QR: Natural Quick Response Codes for Multi-Robot Instance Correspondence

16. Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

17. Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters

18. DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

19. Fast Trainable Projection for Robust Fine-Tuning

20. Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots

21. FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects

22. Memory in Plain Sight: Surveying the Uncanny Resemblances of Associative Memories and Diffusion Models

23. LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration

24. NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes

25. Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion

26. HomeRobot: Open-Vocabulary Mobile Manipulation

27. Continual Adaptation of Vision Transformers for Federated Learning

28. Adaptive Coordination in Social Embodied Rearrangement

29. HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning

30. Training Energy-Based Normalizing Flow with Score-Matching Objectives

31. CLIP-GCD: Simple Language Guided Generalized Category Discovery

32. We Need to Talk: Identifying and Overcoming Communication-Critical Scenarios for Self-Driving

33. Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

34. Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA

35. BC-IRL: Learning Generalizable Reward Functions from Demonstrations

36. Trainable Projected Gradient Method for Robust Fine-tuning

37. OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav

38. Communication-Critical Planning via Multi-Agent Trajectory Exchange

39. System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

40. CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning

41. Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation

42. ConStruct-VL: Data-Free Continual Structured VL Concepts Learning

43. Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision Tasks

44. FedFOR: Stateless Heterogeneous Federated Learning with First-Order Regularization

45. On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition

46. Open-Set Semi-Supervised Object Detection

47. ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization

48. Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors

49. Lifelong Wandering: A realistic few-shot online continual learning setting

50. Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

Catalog

Books, media, physical & digital resources