gcucurull / maml_flaxLinks
Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.
☆19Updated 4 years ago
Alternatives and similar repositories for maml_flax
Users that are interested in maml_flax are comparing it to the libraries listed below
Sorting:
- Simple, extensible implementations of some meta-learning algorithms in Jax☆10Updated 4 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 10 months ago
- Generalised UDRL☆37Updated 3 years ago
- flexible meta-learning in jax☆14Updated last year
- Clockwork VAEs in JAX/Flax☆32Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆26Updated 3 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 7 months ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- ☆23Updated 3 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆13Updated 10 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆16Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 7 months ago
- ☆17Updated last year
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 3 weeks ago
- ☆56Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- ☆17Updated 3 years ago
- ☆18Updated 2 years ago