openai / understanding-rl-visionLinks
Code for the paper "Understanding RL Vision"
☆50Updated 2 years ago
Alternatives and similar repositories for understanding-rl-vision
Users that are interested in understanding-rl-vision are comparing it to the libraries listed below
Sorting:
- Code for the paper "Batch size invariance for policy optimization"☆56Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 5 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- A tool for recording RL trajectories.☆111Updated 6 months ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated 2 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆179Updated 2 years ago
- ☆46Updated last year
- Web application where humans can play Overcooked with AI agents.☆60Updated 3 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆90Updated last month
- Minimal code for A Generalist Agent☆44Updated 3 years ago
- A networking protocol for agent-environment communication☆108Updated last year
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Updated 2 years ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆51Updated last year
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆33Updated 5 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Updated 11 months ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆64Updated 10 months ago
- Repo to reproduce the First-Explore paper results☆39Updated last year
- ☆32Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- Explore and Control with Adversarial Surprise☆10Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- ☆42Updated 3 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆42Updated 3 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆86Updated 3 years ago
- The source code for the gym-microrts paper.☆42Updated 3 years ago
- ☆15Updated last year
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆85Updated 2 years ago