jbkjr / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated 11 months ago
Alternatives and similar repositories for train-procgen-pytorch:
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- Interpreting how transformers simulate agents performing RL tasks☆80Updated last year
- ☆23Updated last year
- ☆36Updated 2 years ago
- ☆28Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- ☆51Updated 2 years ago
- General Modules for JAX☆64Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆54Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆101Updated last year
- ☆77Updated last month
- Scaling scaling laws with board games.☆48Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- Levin tree search guided by both a policy and a heuristic function☆18Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆104Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆161Updated last month
- Drop-in environment replacements that make your RL algorithm train faster.☆20Updated 10 months ago
- ☆20Updated 2 years ago
- PAIRED in PyTorch 🔥☆59Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Standard interface for entity based reinforcement learning environments.☆37Updated last year
- ☆31Updated 2 years ago
- ☆31Updated last year
- An Open-Ended Agentic Simulator☆48Updated 8 months ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆130Updated 8 months ago