Search

Your search keyword '"Cornia, A."' showing total 3,015 results

Search Constraints

Start Over You searched for: Author "Cornia, A." Remove constraint Author: "Cornia, A."
3,015 results on '"Cornia, A."'

Search Results

1. Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis

2. Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

3. Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

4. TPP-Gaze: Modelling Gaze Dynamics in Space and Time with Neural Temporal Point Processes

5. Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments

6. Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training

7. Fluent and Accurate Image Captioning with a Self-Trained Reward Model

8. Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization

9. Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

10. BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

11. Towards Retrieval-Augmented Architectures for Image Captioning

12. Unlearning Vision Transformers Without Retaining Data via Low-Rank Decompositions

13. Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation

14. Fluent and Accurate Image Captioning with a Self-trained Reward Model

15. Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

16. Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

17. BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

18. Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

19. Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation

20. Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing

21. Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images

22. Trends, Applications, and Challenges in Human Attention Modelling

23. The Revolution of Multimodal Large Language Models: A Survey

24. Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

25. OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data

26. With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning

28. Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

29. LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On

30. Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing

31. Multi-Class Unlearning for Image Classification via Weight Filtering

32. Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images

33. Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

34. Embodied Agents for Efficient Exploration and Smart Scene Description

35. Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions

36. The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition

42. ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval

43. Retrieval-Augmented Transformer for Image Captioning

45. Embodied Navigation at the Art Gallery

46. Dress Code: High-Resolution Multi-Category Virtual Try-On

47. Spot the Difference: A Novel Task for Embodied Agents in Changing Environments

48. CaMEL: Mean Teacher Learning for Image Captioning

49. Single-Photon Detectors for Quantum Integrated Photonics

50. Universal Captioner: Inducing Content-Style Separation in Vision-and-Language Model Training

Catalog

Books, media, physical & digital resources