Back to Search
Start Over
Lightweight and Efficient Multimodal Prompt Injection Network for Scene Parsing of Remote Sensing Scene Images
- Source :
- IEEE Transactions on Geoscience and Remote Sensing; 2024, Vol. 62 Issue: 1 p1-9, 9p
- Publication Year :
- 2024
-
Abstract
- Scene parsing of high-resolution remote sensing images with complex backgrounds has received extensive attention in recent years. As unimodal networks are significantly affected by weather conditions, reflecting complex ground conditions fully and accurately is difficult; therefore, multimodal scene analysis is particularly important. Current multimodal scene-parsing networks often employ a dual-coding architecture to achieve high-performance segmentation. Because prompt learning allows models to understand and capture contextual information more effectively, the proposed prompt injection module (PIM) extracts relevant information from frozen normalized digital surface model (nDSM) features and integrates it into the infrared, red, and green (IRRG) branches through a modal embedding block. To extract the contextual semantic relationships between the local and global features in the image efficiently, we also design a dynamic filter block for feature enhancement. This design facilitates the mutual complementarity and guidance of information between the two modalities and optimizes fusion. The experimental results demonstrate that lightweight and effective multimodal prompt injection network (LENet) outperforms most current state-of-the-art lightweight methods on two public datasets, achieving comparable accuracy to that of traditional methods. It has only 10.81 M parameters, with 2.72 GFLOPS. Our code and results are available at <uri>https://github.com/LYZ00918/LENet</uri>.
Details
- Language :
- English
- ISSN :
- 01962892 and 15580644
- Volume :
- 62
- Issue :
- 1
- Database :
- Supplemental Index
- Journal :
- IEEE Transactions on Geoscience and Remote Sensing
- Publication Type :
- Periodical
- Accession number :
- ejs68305101
- Full Text :
- https://doi.org/10.1109/TGRS.2024.3507784