Back to Search
Start Over
DASTSiam: Spatio‐temporal fusion and discriminative enhancement for Siamese visual tracking
- Source :
- IET Computer Vision, Vol 17, Iss 8, Pp 1017-1033 (2023)
- Publication Year :
- 2023
- Publisher :
- Wiley, 2023.
-
Abstract
- Abstract The use of deep neural networks has revolutionised object tracking tasks, and Siamese trackers have emerged as a prominent technique for this purpose. Existing Siamese trackers use a fixed template or template updating technique, but it is prone to overfitting, lacks the capacity to exploit global temporal sequences, and cannot utilise multi‐layer features. As a result, it is challenging to deal with dramatic appearance changes in complicated scenarios. Siamese trackers also struggle to learn background information, which impairs their discriminative ability. Hence, two transformer‐based modules, the Spatio‐Temporal Fusion (ST) module and the Discriminative Enhancement (DE) module, are proposed to improve the performance of Siamese trackers. The ST module leverages cross‐attention to accumulate global temporal cues and generates an attention matrix with ST similarity to enhance the template's adaptability to changes in target appearance. The DE module associates semantically similar points from the template and search area, thereby generating a learnable discriminative mask to enhance the discriminative ability of the Siamese trackers. In addition, a Multi‐Layer ST module (ST + ML) was constructed, which can be integrated into Siamese trackers based on multi‐layer cross‐correlation for further improvement. The authors evaluate the proposed modules on four public datasets and show comparative performance compared to existing Siamese trackers.
Details
- Language :
- English
- ISSN :
- 17519640 and 17519632
- Volume :
- 17
- Issue :
- 8
- Database :
- Directory of Open Access Journals
- Journal :
- IET Computer Vision
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.50a43be7949d4f63968a8e9a6c1f827d
- Document Type :
- article
- Full Text :
- https://doi.org/10.1049/cvi2.12213