Back to Search
Start Over
RGBT Tracking by Trident Fusion Network
- Source :
- IEEE Transactions on Circuits and Systems for Video Technology. 32:579-592
- Publication Year :
- 2022
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2022.
-
Abstract
- In recent years, RGBT tracking has become a hot topic in the field of visual tracking, and made great progress. In this paper, we propose a novel Trident Fusion Network (TFNet) to achieve effective fusion of different modalities for robust RGBT tracking. In specific, to deploy the complementarity of features of all convolutional layers, we propose a recursive strategy to densely aggregate these features that yield robust representations of target objects in two modalities. Moreover, we design a trident architecture to integrate the fused features and both modality-specific features for robust target representations. There are three main advantages. First, retaining the classification layer of each modality is beneficial to enhance feature learning of single modality, and compared with aggregate branches, single-modality branches pay more attention to the mining of modal specific information. Second, when some modality is noisy or invalid, the modality-specific branches would capture more discriminative features for RGBT tracking. Finally, the integration of aggregation branches and single-modality branches is beneficial to the complementary learning of different modalities. In addition, we also introduce a feature pruning module in each branch to prune the redundant features and avoid network overfitting. Experimental results on four RGBT tracking benchmark datasets suggest that our tracker achieves superior performance against the state-of-the-art RGBT tracking methods.
- Subjects :
- Modality (human–computer interaction)
Computer science
business.industry
Pattern recognition
Overfitting
Discriminative model
Feature (computer vision)
Media Technology
Benchmark (computing)
Eye tracking
Pruning (decision trees)
Artificial intelligence
Electrical and Electronic Engineering
business
Feature learning
Subjects
Details
- ISSN :
- 15582205 and 10518215
- Volume :
- 32
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Circuits and Systems for Video Technology
- Accession number :
- edsair.doi...........1acb38335d96d402493a275d617a2e87