Back to Search
Start Over
Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction
- Source :
- ICIP
- Publication Year :
- 2020
-
Abstract
- We analyze the performance of feedforward vs. recurrent neural network (RNN) architectures and associated training methods for learned frame prediction. To this effect, we trained a residual fully convolutional neural network (FCNN), a convolutional RNN (CRNN), and a convolutional long short-term memory (CLSTM) network for next frame prediction using the mean square loss. We performed both stateless and stateful training for recurrent networks. Experimental results show that the residual FCNN architecture performs the best in terms of peak signal to noise ratio (PSNR) at the expense of higher training and test (inference) computational complexity. The CRNN can be trained stably and very efficiently using the stateful truncated backpropagation through time procedure, and it requires an order of magnitude less inference runtime to achieve near real-time frame prediction with an acceptable performance.<br />Accepted for publication at IEEE ICIP 2019
- Subjects :
- FOS: Computer and information sciences
Computer science
business.industry
Computer Vision and Pattern Recognition (cs.CV)
Image and Video Processing (eess.IV)
05 social sciences
Frame (networking)
Computer Science - Computer Vision and Pattern Recognition
Pattern recognition
010501 environmental sciences
Electrical Engineering and Systems Science - Image and Video Processing
01 natural sciences
Convolutional neural network
Memory management
Recurrent neural network
0502 economics and business
FOS: Electrical engineering, electronic engineering, information engineering
Backpropagation through time
Artificial intelligence
050207 economics
business
0105 earth and related environmental sciences
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- ICIP
- Accession number :
- edsair.doi.dedup.....ad5b6bc0ffd744ce0c4177f8914bba21