1. Semantic Relation Model and Dataset for Remote Sensing Scene Understanding
- Author
-
Aziguli Wulamu, Xin Liu, Peng Li, Peng Chen, and Dezheng Zhang
- Subjects
Computer science ,media_common.quotation_subject ,Geography, Planning and Development ,0211 other engineering and technologies ,02 engineering and technology ,Field (computer science) ,semantic relation cognition ,Perception ,0202 electrical engineering, electronic engineering, information engineering ,Earth and Planetary Sciences (miscellaneous) ,remote sensing scene understanding ,Scene graph ,Computers in Earth Sciences ,021101 geological & geomatics engineering ,media_common ,Remote sensing ,Geography (General) ,scene graph generation ,attentional mechanism ,Cognition ,dilated convolution ,Relationship extraction ,multi-scale semantic fusion ,Remote sensing (archaeology) ,graph convolutional network ,Graph (abstract data type) ,G1-922 ,020201 artificial intelligence & image processing ,Semantic gap - Abstract
A deep understanding of our visual world is more than an isolated perception on a series of objects, and the relationships between them also contain rich semantic information. Especially for those satellite remote sensing images, the span is so large that the various objects are always of different sizes and complex spatial compositions. Therefore, the recognition of semantic relations is conducive to strengthen the understanding of remote sensing scenes. In this paper, we propose a novel multi-scale semantic fusion network (MSFN). In this framework, dilated convolution is introduced into a graph convolutional network (GCN) based on an attentional mechanism to fuse and refine multi-scale semantic context, which is crucial to strengthen the cognitive ability of our model Besides, based on the mapping between visual features and semantic embeddings, we design a sparse relationship extraction module to remove meaningless connections among entities and improve the efficiency of scene graph generation. Meanwhile, to further promote the research of scene understanding in remote sensing field, this paper also proposes a remote sensing scene graph dataset (RSSGD). We carry out extensive experiments and the results show that our model significantly outperforms previous methods on scene graph generation. In addition, RSSGD effectively bridges the huge semantic gap between low-level perception and high-level cognition of remote sensing images.
- Published
- 2021