Back to Search Start Over

GLOCAL: A self-supervised learning framework for global and local motion estimation.

Authors :
Zheng, Yihao
Luo, Kunming
Liu, Shuaicheng
Li, Zun
Xiang, Ye
Wu, Lifang
Zeng, Bing
Chen, Chang Wen
Source :
Pattern Recognition Letters. Feb2024, Vol. 178, p91-97. 7p.
Publication Year :
2024

Abstract

Motions in videos are typically a mixture of local dynamic object motions and global camera motion, which are inconsistent in some cases, and even interfere with each other, causing difficulties in various downstream applications, such as video stabilization that requires the global motion, and action recognition that consumes local motions. Therefore, it is crucial to estimate them separately. Existing methods separate two motions from the mixed motion fields, such as optical flow. However, the quality of mixed motion determines the higher bounds of the performance. In this work, we propose a framework, GLOCAL, to directly estimate global and local motions simultaneously from adjacent frames in a self-supervised manner. Our GLOCAL consists of a Global Motion Estimation (GME) module and a Local Motion Estimation (LME) module. The GME module involves a mixed motion estimation backbone, an implicit bottleneck structure for feature dimension reduction, and an explicit bottleneck for global motion recovery based on the global motion bases with foreground mask under the training guidance of proposed global reconstruction loss. An attention U-Net is adopted for LME which produces local motions while excluding motion of irrelevant regions under the guidance of proposed local reconstruction loss. Our method can achieve better performance than the existing methods on the homography estimation dataset DHE and the action recognition dataset NCAA and UCF-101. • A unified framework to estimate global and local motion simultaneously. • An implicit–explicit bottleneck for global motion without irrelevant information. • Two loss functions to learn global and local motion in a self-supervised manner. • Achieves SOTA in regular homography estimation scenes and action recognition task. [Display omitted] [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01678655
Volume :
178
Database :
Academic Search Index
Journal :
Pattern Recognition Letters
Publication Type :
Academic Journal
Accession number :
175240624
Full Text :
https://doi.org/10.1016/j.patrec.2023.12.024