jbkjr / train-procgen-pytorchLinks
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated last year
Alternatives and similar repositories for train-procgen-pytorch
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- ☆53Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆72Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- ☆23Updated last year
- Language-annotated Abstraction and Reasoning Corpus☆98Updated 2 years ago
- Scaling scaling laws with board games.☆54Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- An implementation of MuZero in JAX.☆58Updated 3 years ago
- ☆39Updated last year
- A dataset of alignment research and code to reproduce it☆78Updated 2 years ago
- ☆37Updated 2 years ago
- A framework for experimenting with never-ending learning☆81Updated last year
- ☆57Updated last year
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- ☆75Updated last year
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆192Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Updated 3 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Reinforcement learning library in JAX.☆100Updated 2 years ago
- ☆41Updated 3 years ago
- ☆28Updated 3 years ago
- The Abstraction and Reasoning Corpus made into a web game☆90Updated last year
- AlphaZero in JAX☆81Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆60Updated 3 years ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆88Updated 3 weeks ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Efficient baselines for autocurricula in JAX.☆206Updated last year