openai / atari-demoLinks
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
☆32Updated 6 years ago
Alternatives and similar repositories for atari-demo
Users that are interested in atari-demo are comparing it to the libraries listed below
Sorting:
- OpenAI Retro Contest☆65Updated 2 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆74Updated 2 years ago
- Wikipedia navigation environment for OpenAI Gym☆40Updated 2 years ago
- Training Sonic with RLlib☆59Updated 2 years ago
- Code for the paper "Understanding RL Vision"☆48Updated 2 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆201Updated 6 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆35Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 8 years ago
- Code for the paper "World of Bits: An Open-Domain Platform for Web-Based Agents"☆30Updated 6 years ago
- Code for the paper "Batch size invariance for policy optimization"☆51Updated 2 years ago
- E2C implementation in PyTorch☆43Updated 7 years ago
- ☆43Updated 8 years ago
- ☆44Updated 6 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆53Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago
- ☆117Updated 4 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated 2 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆134Updated 10 months ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 7 years ago
- ☆19Updated 9 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- Official implementation of ICML paper Imitating Latent Policies from Observation☆75Updated 6 years ago