openai / understanding-rl-visionLinks
Code for the paper "Understanding RL Vision"
☆50Updated 2 years ago
Alternatives and similar repositories for understanding-rl-vision
Users that are interested in understanding-rl-vision are comparing it to the libraries listed below
Sorting:
- Code for the paper "Batch size invariance for policy optimization"☆53Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last month
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆75Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆116Updated last year
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆177Updated 2 years ago
- A tool for recording RL trajectories.☆108Updated 3 months ago
- A networking protocol for agent-environment communication☆105Updated 8 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆38Updated 10 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆104Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆135Updated last year
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆51Updated last year
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
- Fast and procedurally generated side-scroller-game-like graphical environments (formerly Procgen)☆29Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆38Updated 7 months ago
- ☆31Updated last year
- Code for the paper "Phasic Policy Gradient"☆266Updated 2 years ago
- Vectorized interface for reinforcement learning environments☆143Updated 2 years ago
- ☆46Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
- An implementation of MuZero in JAX.☆57Updated 2 years ago
- Web application where humans can play Overcooked with AI agents.☆59Updated 2 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆97Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆197Updated last year
- ☆42Updated 3 years ago
- Corax: Core RL in JAX☆38Updated last year
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Updated last year
- Deep Hierarchical Planning from Pixels☆109Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆142Updated 2 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆84Updated last year