Back to Search Start Over

SOFT: Self-supervised sparse Optical Flow Transformer for video stabilization via quaternion.

Authors :
Wang, Naiyao
Zhou, Changdong
Zhu, Rongfeng
Zhang, Bo
Wang, Ye
Liu, Hongbo
Source :
Engineering Applications of Artificial Intelligence. Apr2024, Vol. 130, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Video stabilization is crucial for video representation learning, which suffers from the challenges such as the perception of unstable vision, the stripping and cognition of target motion features in complex scenes, the correction of the jittery camera systems trails. In this paper, we propose a Self-supervised sparse Optical Flow Transformer (SOFT) model, consisting of a self-supervised contrastive learning transformer network, a sparse optical flow perception network and a multimodal cognitive fusion network. The SOFT model takes advantage of optical flow to estimate motion. The sparse optical flow perception network perceiving partially sparse optical flow containing motion features. This serves as the input to the self-supervised contrastive learning transformer network for generating sparse optical flow features, which are fed into the multimodal cognitive fusion network together with the real and virtual camera pose for video frame warping. Experimental comparisons with state-of-the-art models on 4 metrics demonstrate the effectiveness of the SOFT model. It achieves the best performance with an average Stability of 0.869 and average Distortion of 0.993 across 6 categories videos, which shows that the SOFT model can effectively perceive the motion in the video and smooth the jitter track of videos. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09521976
Volume :
130
Database :
Academic Search Index
Journal :
Engineering Applications of Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
175936539
Full Text :
https://doi.org/10.1016/j.engappai.2023.107725