51. Designing Efficient Neural Attention Systems Towards Achieving Human-level Sharp Vision
- Author
-
Abdul, Ghani Abdul Rahman, Koganti, Nishanth, Solano, Alfredo, Iwasawa, Yusuke, Nakayama, Kotaro, Matsuo, Yutaka, Abdul, Ghani Abdul Rahman, Koganti, Nishanth, Solano, Alfredo, Iwasawa, Yusuke, Nakayama, Kotaro, and Matsuo, Yutaka
- Abstract
Human vision is capable of focusing on subtle visual cues at high resolution by relying on a foveal view coupled with an attention mechanism. Recently, there have been several studies that proposed deep reinforcement learning based attention models. However, these studies do not explicitly consider the design of a foveal representation and its effect on an attention system is unclear. In this paper, we investigate the effect of using a hierarchy of visual streams in training an efficient attention model towards achieving a human-level sharp vision. We perform our evaluation on a simulated human-robot interaction task where the agent attends to faces that are looking at it. The experimental results show that the performance of the system relies on factors such as the number of visual streams, their relative field-of-view and we demonstrate that maintaining a hierarchy within the visual streams is crucial to learn attention strategies., Workshop on Sixth International Conference on Learning Representations(ICLR 2018), May 3, 2018, Vancouver Canada
- Published
- 2023