Back to Search Start Over

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation

Authors :
Li, Jiahao
Xu, Yuanyou
Yang, Zongxin
Yang, Yi
Zhuang, Yueting
Publication Year :
2023

Abstract

The Associating Objects with Transformers (AOT) framework has exhibited exceptional performance in a wide range of complex scenarios for video object segmentation. In this study, we introduce MSDeAOT, a variant of the AOT series that incorporates transformers at multiple feature scales. Leveraging the hierarchical Gated Propagation Module (GPM), MSDeAOT efficiently propagates object masks from previous frames to the current frame using a feature scale with a stride of 16. Additionally, we employ GPM in a more refined feature scale with a stride of 8, leading to improved accuracy in detecting and tracking small objects. Through the implementation of test-time augmentations and model ensemble techniques, we achieve the top-ranking position in the EPIC-KITCHEN VISOR Semi-supervised Video Object Segmentation Challenge.<br />Comment: Top 1 solution for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2307.02010
Document Type :
Working Paper