jbkjr / train-procgen-pytorchLinks
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated last year
Alternatives and similar repositories for train-procgen-pytorch
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- ☆52Updated 2 years ago
- Interpreting how transformers simulate agents performing RL tasks☆88Updated last year
- Scaling scaling laws with board games.☆53Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Baselines for gymnax 🤖☆72Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- An implementation of MuZero in JAX.☆57Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- A programming language for formal/informal computation.☆41Updated 2 months ago
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆190Updated last year
- AlphaZero in JAX☆78Updated last year
- Efficient baselines for autocurricula in JAX.☆196Updated last year
- ☆41Updated 3 years ago
- A networking protocol for agent-environment communication☆104Updated 8 months ago
- Code for the paper "Understanding RL Vision"☆50Updated 2 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆132Updated last year
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Updated 3 years ago
- ☆23Updated last year
- Neuro-evolution for OpenAI Gym environments☆57Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆115Updated last year
- Reinforcement learning library in JAX.☆100Updated last year
- ☆84Updated last month
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated last year
- ☆31Updated 3 years ago
- megastep helps you build 1-million FPS reinforcement learning environments on a single GPU☆142Updated 3 years ago
- Accelerated replay buffers in JAX☆43Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago