openai / atari-demoLinks
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
☆32Updated 6 years ago
Alternatives and similar repositories for atari-demo
Users that are interested in atari-demo are comparing it to the libraries listed below
Sorting:
- OpenAI Retro Contest☆65Updated 2 years ago
- Training Sonic with RLlib☆59Updated 2 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆203Updated 6 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆306Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆74Updated 2 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆172Updated 2 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- Code for the paper "Evolved Policy Gradients"☆250Updated 6 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Updated 2 years ago
- Vectorized interface for reinforcement learning environments☆140Updated 2 years ago
- ☆117Updated 5 years ago
- StarCraft: BroodWars OpenAI Gym environment☆83Updated 6 years ago
- ☆44Updated 6 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆125Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆134Updated 11 months ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 7 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆398Updated last year
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- Machine Learning Course Project Skoltech 2018☆108Updated 6 years ago
- Gym - Doom environments based on VizDoom.☆103Updated 8 years ago
- Highly Modular and Scalable Reinforcement Learning☆115Updated 5 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated 2 years ago
- ☆43Updated 8 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago