smonsays / metaxLinks
flexible meta-learning in jax
☆16Updated 2 years ago
Alternatives and similar repositories for metax
Users that are interested in metax are comparing it to the libraries listed below
Sorting:
- Accelerated replay buffers in JAX☆46Updated 3 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- An implementation of MuZero in JAX.☆58Updated 3 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Updated last year
- General Modules for JAX☆72Updated 3 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆32Updated 2 years ago
- ☆19Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 4 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- JAX implementations of core Deep RL algorithms☆82Updated 3 years ago
- Building blocks for productive research☆66Updated 5 months ago
- Baselines for gymnax 🤖☆74Updated 2 years ago
- Dreamer on JAX☆16Updated 3 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Updated last month
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 3 years ago
- ☆28Updated 3 years ago
- ☆46Updated last year
- ☆35Updated last year
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 8 months ago
- ☆58Updated 3 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆72Updated last year
- PyTorch Package For Quasimetric Learning☆44Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆22Updated last year
- Generalised UDRL☆37Updated 3 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Updated 2 years ago