jbkjr / train-procgen-pytorchLinks
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated last year
Alternatives and similar repositories for train-procgen-pytorch
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
Sorting:
- ☆19Updated 3 years ago
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 4 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- ☆53Updated 2 years ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- ☆39Updated last year
- ☆23Updated last year
- Redwood Research's transformer interpretability tools☆15Updated 3 years ago
- Language-annotated Abstraction and Reasoning Corpus☆99Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆24Updated 3 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆61Updated 3 years ago
- An environment for learning formal mathematical reasoning from scratch☆72Updated last year
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆73Updated last year
- ☆41Updated 3 years ago
- AlphaZero in JAX☆81Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆135Updated 2 years ago
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆83Updated 3 years ago
- ☆31Updated 3 years ago
- ☆37Updated 2 years ago
- ☆28Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- A dataset of alignment research and code to reproduce it☆78Updated 2 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- ☆101Updated last year
- Learning Universal Predictors☆81Updated last year
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- ☆57Updated last year