Deep Planning Network (PlaNet), a model-based agent that learns the environment dynamics from pixels and chooses actions through online planning in a compact latent space. To learn the dynamics, we use a transition model with both stochastic and deterministic components and train it using a generalized variational objective that encourages multi-step predictions. PlaNet solves continuous control tasks from pixels that are more difficult than those previously solved by planning with learned models.
The open source code is available here --> https://github.com/google-research/planet <--
This repo contains the source for the article.
draft.md - main text of the article, in markdown.
draft_appendix.md - appendix, in markdown.
draft_bib.html - the citations.
draft_header.html - start of the document
index.html - generated, don't edit this file.
git clone https://github.com/planetrl/planetrl.github.io.git
cd planetrl.github.io
npm install
chmod +x ./bin/*Modify text by editing draft.md -- this is where all of the content exists.
Appendix content goes in draft_appendix.md. Add bib entries to draft_bib.html.
Run ./bin/make to build document into index.html (which are identical).
Run python -m http.server to serve on the base directory to view draft.html in a local browser for debugging.
To watch all markdown files for changes and then compile them, you can run the following
brew install fswatch
./bin/watch