1. Vector Semiotic Model for Visual Question Answering
- Author
-
Aleksandr I. Panov, Alexey K. Kovalev, Evgeny Osipov, and Makhmud Shaban
- Subjects
Structure (mathematical logic) ,Computer science ,business.industry ,Cognitive Neuroscience ,Sign (semiotics) ,Experimental and Cognitive Psychology ,Context (language use) ,Symbol grounding ,Artificial Intelligence ,Embodied cognition ,Encoding (memory) ,Question answering ,Semiotics ,Artificial intelligence ,business ,Software - Abstract
In this paper, we propose a Vector Semiotic Model as a possible solution to the symbol grounding problem in the context of Visual Question Answering. The Vector Semiotic Model combines the advantages of a Semiotic Approach implemented in the Sign-Based World Model and Vector Symbolic Architectures. The Sign-Based World Model represents information about a scene depicted on an input image in a structured way and grounds abstract objects in an agent’s sensory input. We use the Vector Symbolic Architecture to represent the elements of the Sign-Based World Model on a computational level. Properties of a high-dimensional space and operations defined for high-dimensional vectors allow encoding the whole scene into a high-dimensional vector with the preservation of the structure. That leads to the ability to apply explainable reasoning to answer an input question. We conducted experiments are on a CLEVR dataset and show results comparable to the state of the art. The proposed combination of approaches, first, leads to the possible solution of the symbol-grounding problem and, second, allows expanding current results to other intelligent tasks (collaborative robotics, embodied intellectual assistance, etc.).
- Published
- 2022