Stochastic Latent Residual Video Prediction

Jean-Yves Franceschi; Edouard Delasalles; Mickael Chen; Sylvain Lamprier; Patrick Gallinari

Stochastic Latent Residual Video Prediction

Jean-Yves Franceschi, Edouard Delasalles, Mickael Chen, Sylvain Lamprier, Patrick Gallinari

Proceedings of the 37th International Conference on Machine Learning, PMLR 119:3233-3246, 2020.

Abstract

Designing video prediction models that account for the inherent uncertainty of the future is challenging. Most works in the literature are based on stochastic image-autoregressive recurrent networks, which raises several performance and applicability issues. An alternative is to use fully latent temporal models which untie frame synthesis and temporal dynamics. However, no such model for stochastic video prediction has been proposed in the literature yet, due to design and training difficulties. In this paper, we overcome these difficulties by introducing a novel stochastic temporal model whose dynamics are governed in a latent space by a residual update rule. This first-order scheme is motivated by discretization schemes of differential equations. It naturally models video dynamics as it allows our simpler, more interpretable, latent model to outperform prior state-of-the-art methods on challenging datasets.

Cite this Paper

BibTeX

@InProceedings{pmlr-v119-franceschi20a,
  title = 	 {Stochastic Latent Residual Video Prediction},
  author =       {Franceschi, Jean-Yves and Delasalles, Edouard and Chen, Mickael and Lamprier, Sylvain and Gallinari, Patrick},
  booktitle = 	 {Proceedings of the 37th International Conference on Machine Learning},
  pages = 	 {3233--3246},
  year = 	 {2020},
  editor = 	 {III, Hal Daumé and Singh, Aarti},
  volume = 	 {119},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--18 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v119/franceschi20a/franceschi20a.pdf},
  url = 	 {https://proceedings.mlr.press/v119/franceschi20a.html},
  abstract = 	 {Designing video prediction models that account for the inherent uncertainty of the future is challenging. Most works in the literature are based on stochastic image-autoregressive recurrent networks, which raises several performance and applicability issues. An alternative is to use fully latent temporal models which untie frame synthesis and temporal dynamics. However, no such model for stochastic video prediction has been proposed in the literature yet, due to design and training difficulties. In this paper, we overcome these difficulties by introducing a novel stochastic temporal model whose dynamics are governed in a latent space by a residual update rule. This first-order scheme is motivated by discretization schemes of differential equations. It naturally models video dynamics as it allows our simpler, more interpretable, latent model to outperform prior state-of-the-art methods on challenging datasets.}
}

Endnote

%0 Conference Paper
%T Stochastic Latent Residual Video Prediction
%A Jean-Yves Franceschi
%A Edouard Delasalles
%A Mickael Chen
%A Sylvain Lamprier
%A Patrick Gallinari
%B Proceedings of the 37th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2020
%E Hal Daumé III
%E Aarti Singh	
%F pmlr-v119-franceschi20a
%I PMLR
%P 3233--3246
%U https://proceedings.mlr.press/v119/franceschi20a.html
%V 119
%X Designing video prediction models that account for the inherent uncertainty of the future is challenging. Most works in the literature are based on stochastic image-autoregressive recurrent networks, which raises several performance and applicability issues. An alternative is to use fully latent temporal models which untie frame synthesis and temporal dynamics. However, no such model for stochastic video prediction has been proposed in the literature yet, due to design and training difficulties. In this paper, we overcome these difficulties by introducing a novel stochastic temporal model whose dynamics are governed in a latent space by a residual update rule. This first-order scheme is motivated by discretization schemes of differential equations. It naturally models video dynamics as it allows our simpler, more interpretable, latent model to outperform prior state-of-the-art methods on challenging datasets.

APA

Franceschi, J., Delasalles, E., Chen, M., Lamprier, S. & Gallinari, P.. (2020). Stochastic Latent Residual Video Prediction. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:3233-3246 Available from https://proceedings.mlr.press/v119/franceschi20a.html.

Stochastic Latent Residual Video Prediction

Abstract

Cite this Paper

Related Material