Back to Search Start Over

Structure-Constrained Motion Sequence Generation.

Authors :
Yan, Yichao
Ni, Bingbing
Zhang, Wendong
Xu, Jingwei
Yang, Xiaokang
Source :
IEEE Transactions on Multimedia; Jul2019, Vol. 21 Issue 7, p1799-1812, 14p
Publication Year :
2019

Abstract

Video generation is a challenging task due to the extremely high-dimensional distribution of the solution space. Good constraints in the solution domain would thus reduce the difficulty of approximating optimal solutions. In this paper, instead of directly generating high-dimensional video data, we propose using object landmarks as explicit structure constraints to address this issue. Specifically, we propose a two-stage framework for an action-conditioned video generation task. In our framework, the first stage aims to generate landmark sequences according to predefined motion types, and a recurrent model (RNN/LSTM) is adopted for this purpose. The landmark sequence can be regarded as a low-dimensional structure embedding of high-dimensional video data, and generating landmark sequences is much easier than generating videos. The second stage is inspired by a conditional generative adversarial network (CGAN), and we take the generated landmark sequence as a structure condition to learn a landmark-to-image translation network. Such a one-to-one translation framework avoids the difficulty of generating videos and instead transfers the video generation task to image generation, which is resolvable due to the maturity of current GAN-based models. The experimental results demonstrate that our model not only achieves promising results on rigid/nonrigid motion generation tasks but also can be extended to multiobject motion situations. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15209210
Volume :
21
Issue :
7
Database :
Complementary Index
Journal :
IEEE Transactions on Multimedia
Publication Type :
Academic Journal
Accession number :
137214019
Full Text :
https://doi.org/10.1109/TMM.2018.2885235