Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for model_ensemble_meta_learning
Users that are interested in model_ensemble_meta_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- NeurIPS Reproducibility Challenge 2019☆21Feb 25, 2020Updated 6 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆42Aug 27, 2022Updated 3 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆248Sep 30, 2022Updated 3 years ago
- The reimplementation of Model Predictive Path Integral (MPPI) from the paper "Information Theoretic MPC for Model-Based Reinforcement Lea…☆104Jul 11, 2025Updated 8 months ago
- ☆399Jul 18, 2019Updated 6 years ago
- Thesis: Application of Reinforcement Learning for the Control of Nonlinear Dynamical Systems☆18Apr 16, 2020Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13May 5, 2021Updated 4 years ago
- Evolving Neural Network through the Reverse Encoding Tree☆15Jun 2, 2021Updated 4 years ago
- A collection of simple worlds☆12Nov 26, 2019Updated 6 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆505Dec 1, 2022Updated 3 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Meta-learning Gaussian process (GP) priors via PAC-Bayes bounds☆26Jan 25, 2024Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- Unofficial Implementation of the paper "Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control", applied to gy…☆148May 15, 2023Updated 2 years ago
- ☆20Nov 13, 2023Updated 2 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- Bayesian Reinforcement Learning in Tensorflow☆336Feb 15, 2021Updated 5 years ago
- code to reproduce the empirical results in the research paper☆38Oct 12, 2021Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆472Jul 6, 2023Updated 2 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago