Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.
☆27May 11, 2021Updated 4 years ago
Alternatives and similar repositories for maml-rl-tf2
Users that are interested in maml-rl-tf2 are comparing it to the libraries listed below
Sorting:
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 7 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- PyTorch implementation of Episodic Meta Reinforcement Learning on variants of the "Two-Step" task. Reproduces the results found in three …☆37Dec 12, 2020Updated 5 years ago
- PyTorch implementation of two variants of the Harlow visual fixation task (PsychLab and 1D version). Reproduces the results found in two …☆14Sep 2, 2020Updated 5 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆248Sep 30, 2022Updated 3 years ago
- ☆15Sep 14, 2020Updated 5 years ago
- A TensorFlow 2.0 with eager execution implementation of Pytorch OpenAI few-shot regression toy example☆16Jun 24, 2019Updated 6 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Meta Reinforcement Learning Experiments☆35Aug 22, 2017Updated 8 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆21Sep 18, 2020Updated 5 years ago
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆876Dec 27, 2022Updated 3 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Feb 1, 2023Updated 3 years ago
- A well-documented A2C written in PyTorch☆52Jun 3, 2019Updated 6 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Jan 31, 2020Updated 6 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 5 years ago
- A curated list of awesome Meta Reinforcement Learning☆33May 7, 2020Updated 5 years ago
- Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"☆664Jan 19, 2023Updated 3 years ago
- ☆80Dec 9, 2022Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- Fleet Management Simulation Framework☆33Feb 24, 2019Updated 7 years ago
- Introduction to Gaussian Processes☆11Jan 13, 2024Updated 2 years ago
- PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.☆38Jun 22, 2022Updated 3 years ago
- Code for "Fast Context Adaptation via Meta-Learning"☆146Mar 22, 2021Updated 4 years ago
- Reinforcement Learning using the Actor-Critic framework for the L2RPN challenge (https://l2rpn.chalearn.org/ & https://competitions.codal…☆39Jul 15, 2019Updated 6 years ago
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 2 years ago
- Code for Paper "Effective Multi-agent Reinforcement Learning Control with Relative Entropy Regularization".☆13Sep 27, 2023Updated 2 years ago
- MATLAB implementation of the universal directed information estimators in Jiantao Jiao, Haim H. Permuter, Lei Zhao, Young-Han Kim, and Ts…☆11Apr 2, 2019Updated 6 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 3 years ago
- ☆10Aug 13, 2022Updated 3 years ago
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- ☆11Jan 10, 2020Updated 6 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago