Search

Your search keyword '"Visual Question Answering"' showing total 677 results

Search Constraints

Start Over You searched for: Descriptor "Visual Question Answering" Remove constraint Descriptor: "Visual Question Answering"
677 results on '"Visual Question Answering"'

Search Results

1. Towards Open-Ended Visual Quality Comparison

2. WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering

3. Overview of the Trauma THOMPSON Challenge at MICCAI 2023

4. The Trauma THOMPSON Challenge Report MICCAI 2023

5. Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

6. Multi-stage reasoning on introspecting and revising bias for visual question answering.

7. Learning to enhance areal video captioning with visual question answering.

8. DCF–VQA: Counterfactual Structure Based on Multi–Feature Enhancement

9. Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering

10. Integrating IoT and visual question answering in smart cities: Enhancing educational outcomes

11. Vision transformer-based visual language understanding of the construction process

12. Multimodal attention-driven visual question answering for Malayalam.

13. HRVQA: A Visual Question Answering benchmark for high-resolution aerial images.

14. ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese.

15. Sign-based image criteria for social interaction visual question answering.

16. Vision transformer-based visual language understanding of the construction process.

17. Advancing surgical VQA with scene graph knowledge.

18. DCF-VQA: COUNTERFACTUAL STRUCTURE BASED ON MULTI--FEATURE ENHANCEMENT.

19. Learning a Mixture of Conditional Gating Blocks for Visual Question Answering.

20. Enhancing machine vision: the impact of a novel innovative technology on video question-answering.

21. TRANS-VQA: Fully Transformer-Based Image Question-Answering Model Using Question-guided Vision Attention.

22. EarthVQANet: Multi-task visual question answering for remote sensing image understanding.

23. Knowledge-aware image understanding with multi-level visual representation enhancement for visual question answering.

24. A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answering.

25. Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training

26. Region-Specific Retrieval Augmentation for Longitudinal Visual Question Answering: A Mix-and-Match Paradigm

27. Can LLMs’ Tuning Methods Work in Medical Multimodal Domain?

28. Overview of the ImageCLEF 2024: Multimedia Retrieval in Medical Applications

29. CHIC: Corporate Document for Visual Question Answering

30. CircuitVQA: A Visual Question Answering Dataset for Electrical Circuit Images

31. VQA-PDF: Purifying Debiased Features for Robust Visual Question Answering Task

32. Image Understanding Through Visual Question Answering: A Review from Past Research

33. IIU: Independent Inference Units for Knowledge-Based Visual Question Answering

34. Experiential Questioning for VQA

36. Generating Type-Related Instances and Metric Learning to Overcoming Language Priors in VQA

37. GViG: Generative Visual Grounding Using Prompt-Based Language Modeling for Visual Question Answering

38. A Balanced Counting Visual Question Answering Dataset

39. Evaluation of Systematic Errors in Visual Question Answering

40. Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024

41. Cross-Modal Retrieval for Knowledge-Based Visual Question Answering

42. Can Machines and Humans Use Negation When Describing Images?

43. Weakly-Supervised Grounding for VQA with Dual Visual-Linguistic Interaction

44. Visual Question Answering – VizWiz Challenge

45. VCD: Visual Causality Discovery for Cross-Modal Question Reasoning

46. Enhancing Image Comprehension for Computer Science Visual Question Answering

47. Syntax Tree Constrained Graph Network for Visual Question Answering

48. Dual modality prompt learning for visual question-grounded answering in robotic surgery

49. Graph neural networks for visual question answering: a systematic review.

50. Learning the Meanings of Function Words From Grounded Language Using a Visual Question Answering Model.

Catalog

Books, media, physical & digital resources