jbkjr / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for train-procgen-pytorch
- ☆18Updated last year
- Redwood Research's transformer interpretability tools☆12Updated 2 years ago
- ☆35Updated last year
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- Interpreting how transformers simulate agents performing RL tasks☆73Updated last year
- ☆28Updated 2 years ago
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last month
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 2 years ago
- ☆21Updated 7 months ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- ☆48Updated last year
- ☆17Updated last year
- flexible meta-learning in jax☆12Updated last year
- Measuring the situational awareness of language models☆33Updated 9 months ago
- ☆63Updated 3 months ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆63Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆36Updated 3 weeks ago
- Baselines for gymnax 🤖☆60Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆64Updated 2 years ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆27Updated this week
- PAIRED in PyTorch 🔥☆56Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- ☆36Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆126Updated 3 months ago
- Levin tree search guided by both a policy and a heuristic function☆14Updated last year
- Standard interface for entity based reinforcement learning environments.☆36Updated 8 months ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆63Updated 2 months ago
- ☆29Updated 2 years ago