openai / understanding-rl-vision
Code for the paper "Understanding RL Vision"
☆46Updated last year
Alternatives and similar repositories for understanding-rl-vision:
Users that are interested in understanding-rl-vision are comparing it to the libraries listed below
- Code for the paper "Batch size invariance for policy optimization"☆48Updated last year
- ☆42Updated 2 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆32Updated 6 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- ☆14Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆171Updated last year
- A networking protocol for agent-environment communication☆99Updated last month
- ExORL: Exploratory Data for Offline Reinforcement Learning☆111Updated 3 years ago
- Web application where humans can play Overcooked with AI agents.☆58Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆69Updated last year
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated last year
- ☆28Updated 2 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆60Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- ☆31Updated last year
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 2 years ago
- ☆43Updated 6 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- A tool for recording RL trajectories.☆100Updated 4 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆30Updated 4 years ago