jbkjr / train-procgen-pytorchLinks
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated last year
Alternatives and similar repositories for train-procgen-pytorch
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- Interpreting how transformers simulate agents performing RL tasks☆82Updated last year
- ☆28Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆32Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- ☆53Updated 7 months ago
- Code associated to papers on superposition (in ML interpretability)☆28Updated 2 years ago
- ☆101Updated last year
- flexible meta-learning in jax☆14Updated last year
- Baselines for gymnax 🤖☆66Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 9 months ago
- ☆51Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆36Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- PAIRED in PyTorch 🔥☆60Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Redwood Research's transformer interpretability tools☆15Updated 3 years ago
- General Modules for JAX☆66Updated last month
- ☆39Updated 3 years ago
- ☆31Updated 2 years ago
- ☆39Updated 10 months ago
- Fast and procedurally generated side-scroller-game-like graphical environments (formerly Procgen)☆28Updated last year
- ☆79Updated 2 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago