1. FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis
- Author
-
Kuangxiao Gu, Yuqian Zhou, and Thomas S. Huang
- Subjects
FOS: Computer and information sciences ,Landmark ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,02 engineering and technology ,General Medicine ,Face space ,0202 electrical engineering, electronic engineering, information engineering ,Learning network ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Image warping ,business ,Computer facial animation - Abstract
Talking face synthesis has been widely studied in either appearance-based or warping-based methods. Previous works mostly utilize single face image as a source, and generate novel facial animations by merging other person's facial features. However, some facial regions like eyes or teeth, which may be hidden in the source image, can not be synthesized faithfully and stably. In this paper, We present a landmark driven two-stream network to generate faithful talking facial animation, in which more facial details are created, preserved and transferred from multiple source images instead of a single one. Specifically, we propose a network consisting of a learning and fetching stream. The fetching sub-net directly learns to attentively warp and merge facial regions from five source images of distinctive landmarks, while the learning pipeline renders facial organs from the training face space to compensate. Compared to baseline algorithms, extensive experiments demonstrate that the proposed method achieves a higher performance both quantitatively and qualitatively. Codes are at https://github.com/kgu3/FLNet_AAAI2020., Accepted by AAAI 2020
- Published
- 2020
- Full Text
- View/download PDF