Generative Temporal Models with Spatial Memory for Partially Observed Environments

Marco Fraccaro*, Danilo Jimenez Rezende, Zwols Yori, Alexander Pritzel, S. M.Ali Eslami, Viola Fabio

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

33 Downloads (Pure)

Abstract

In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning mechanism. However, their application in practice has been limited to simplistic environments, due to the difficulty of training such models in larger, potentially partially-observed and 3D environments. In this work we introduce a novel action-conditioned generative model of such challenging environments. The model features a non-parametric spatial memory system in which we store learned, disentangled representations of the environment. Low-dimensional spatial updates are computed using a state-space model that makes use of knowledge on the prior dynamics of the moving agent, and high-dimensional visual observations are modelled with a Variational Auto-Encoder. The result is a scalable architecture capable of performing coherent predictions over hundreds of time steps across a range of partially observed 2D and 3D environments.

Original languageEnglish
Title of host publicationProceedings of 35th International Conference on Machine Learning
EditorsJennifer Dy, Andreas Krause
PublisherInternational Machine Learning Society (IMLS)
Publication date1 Jan 2018
Pages2518-2527
ISBN (Electronic)9781510867963
Publication statusPublished - 1 Jan 2018
Event35th International Conference on Machine Learning - Stockholm, Sweden
Duration: 10 Jul 201815 Jul 2018
Conference number: 35

Conference

Conference35th International Conference on Machine Learning
Number35
Country/TerritorySweden
CityStockholm
Period10/07/201815/07/2018
Series35th International Conference on Machine Learning, ICML 2018
Volume4

Fingerprint

Dive into the research topics of 'Generative Temporal Models with Spatial Memory for Partially Observed Environments'. Together they form a unique fingerprint.

Cite this