Back to Search Start Over

TSDTVOS: Target-guided spatiotemporal dual-stream transformers for video object segmentation.

Authors :
Zhou, Wei
Zhao, Yuqian
Zhang, Fan
Luo, Biao
Yu, Lingli
Chen, Baifan
Yang, Chunhua
Gui, Weihua
Source :
Neurocomputing. Oct2023, Vol. 555, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• A novel Transformer-based framework is proposed for video object segmentation. • Target guidance block is explored for integrating temporal features. • Our method is superior to some existing video object segmentation methods. • Our method can deal with scale variance and distinguish similar objects. Video object segmentation automatically separates the interested objects from the background across a video sequence and was an active research area in recent years. The crucial challenge lies in investigating an effective architecture to fully exploit spatiotemporal correlation in a given video sequence for achieving accurate segmentation results. In this paper, we propose a novel semi-supervised Transformer-based framework called Target-guided Spatiotemporal Dual-stream Transformers (TSDT) with two separate streams to enable effective spatiotemporal context propagation. Technically, the temporal stream is used to aggregate rich temporal cues from past frames, while the spatial stream is trained to encode object location and appearance information stored in the current frame. To compress and integrate temporal features, a target guidance block (TGB) is designed to retrieve target information in the past video flow under the guidance of the current frame. The experimental results on video object segmentation benchmarks demonstrate the feasibility and effectiveness of the proposed framework. Codes and trained models are available at https://github.com/zhouweii234/TSDTVOS. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
555
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
170721228
Full Text :
https://doi.org/10.1016/j.neucom.2023.126582