Improving Generative Imagination in Object-Centric World Models

Zhixuan Lin; Yi-Fu Wu; Skand Peri; Bofeng Fu; Jindong Jiang; Sungjin Ahn

Improving Generative Imagination in Object-Centric World Models

Zhixuan Lin, Yi-Fu Wu, Skand Peri, Bofeng Fu, Jindong Jiang, Sungjin Ahn

Proceedings of the 37th International Conference on Machine Learning, PMLR 119:6140-6149, 2020.

Abstract

The remarkable recent advances in object-centric generative world models raise a few questions. First, while many of the recent achievements are indispensable for making a general and versatile world model, it is quite unclear how these ingredients can be integrated into a unified framework. Second, despite using generative objectives, abilities for object detection and tracking are mainly investigated, leaving the crucial ability of temporal imagination largely under question. Third, a few key abilities for more faithful temporal imagination such as multimodal uncertainty and situation-awareness are missing. In this paper, we introduce Generative Structured World Models (G-SWM). The G-SWM achieves the versatile world modeling not only by unifying the key properties of previous models in a principled framework but also by achieving two crucial new abilities, multimodal uncertainty and situation-awareness. Our thorough investigation on the temporal generation ability in comparison to the previous models demonstrates that G-SWM achieves the versatility with the best or comparable performance for all experiment settings including a few complex settings that have not been tested before. https://sites.google.com/view/gswm

Cite this Paper

BibTeX

@InProceedings{pmlr-v119-lin20f,
  title = 	 {Improving Generative Imagination in Object-Centric World Models},
  author =       {Lin, Zhixuan and Wu, Yi-Fu and Peri, Skand and Fu, Bofeng and Jiang, Jindong and Ahn, Sungjin},
  booktitle = 	 {Proceedings of the 37th International Conference on Machine Learning},
  pages = 	 {6140--6149},
  year = 	 {2020},
  editor = 	 {III, Hal Daumé and Singh, Aarti},
  volume = 	 {119},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--18 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v119/lin20f/lin20f.pdf},
  url = 	 {https://proceedings.mlr.press/v119/lin20f.html},
  abstract = 	 {The remarkable recent advances in object-centric generative world models raise a few questions. First, while many of the recent achievements are indispensable for making a general and versatile world model, it is quite unclear how these ingredients can be integrated into a unified framework. Second, despite using generative objectives, abilities for object detection and tracking are mainly investigated, leaving the crucial ability of temporal imagination largely under question. Third, a few key abilities for more faithful temporal imagination such as multimodal uncertainty and situation-awareness are missing. In this paper, we introduce Generative Structured World Models (G-SWM). The G-SWM achieves the versatile world modeling not only by unifying the key properties of previous models in a principled framework but also by achieving two crucial new abilities, multimodal uncertainty and situation-awareness. Our thorough investigation on the temporal generation ability in comparison to the previous models demonstrates that G-SWM achieves the versatility with the best or comparable performance for all experiment settings including a few complex settings that have not been tested before. https://sites.google.com/view/gswm}
}

Endnote

%0 Conference Paper
%T Improving Generative Imagination in Object-Centric World Models
%A Zhixuan Lin
%A Yi-Fu Wu
%A Skand Peri
%A Bofeng Fu
%A Jindong Jiang
%A Sungjin Ahn
%B Proceedings of the 37th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2020
%E Hal Daumé III
%E Aarti Singh	
%F pmlr-v119-lin20f
%I PMLR
%P 6140--6149
%U https://proceedings.mlr.press/v119/lin20f.html
%V 119
%X The remarkable recent advances in object-centric generative world models raise a few questions. First, while many of the recent achievements are indispensable for making a general and versatile world model, it is quite unclear how these ingredients can be integrated into a unified framework. Second, despite using generative objectives, abilities for object detection and tracking are mainly investigated, leaving the crucial ability of temporal imagination largely under question. Third, a few key abilities for more faithful temporal imagination such as multimodal uncertainty and situation-awareness are missing. In this paper, we introduce Generative Structured World Models (G-SWM). The G-SWM achieves the versatile world modeling not only by unifying the key properties of previous models in a principled framework but also by achieving two crucial new abilities, multimodal uncertainty and situation-awareness. Our thorough investigation on the temporal generation ability in comparison to the previous models demonstrates that G-SWM achieves the versatility with the best or comparable performance for all experiment settings including a few complex settings that have not been tested before. https://sites.google.com/view/gswm

APA

Lin, Z., Wu, Y., Peri, S., Fu, B., Jiang, J. & Ahn, S.. (2020). Improving Generative Imagination in Object-Centric World Models. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:6140-6149 Available from https://proceedings.mlr.press/v119/lin20f.html.

Improving Generative Imagination in Object-Centric World Models

Abstract

Cite this Paper

Related Material