Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for model_ensemble_meta_learning
Users that are interested in model_ensemble_meta_learning are comparing it to the libraries listed below
Sorting:
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Aug 27, 2022Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 6 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Guided-Meta Policy Search☆39Jan 19, 2023Updated 3 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- The reimplementation of Model Predictive Path Integral (MPPI) from the paper "Information Theoretic MPC for Model-Based Reinforcement Lea…☆104Jul 11, 2025Updated 7 months ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆248Sep 30, 2022Updated 3 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Dec 31, 2016Updated 9 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 9 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in TensorFlow 2.☆27May 11, 2021Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆532Nov 22, 2022Updated 3 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- ☆14Jun 21, 2016Updated 9 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- Evolving Neural Network through the Reverse Encoding Tree☆15Jun 2, 2021Updated 4 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 8 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- [deprecated] Engine Agnostic Gym Environment for Robotics☆17Feb 10, 2022Updated 4 years ago
- ☆36Oct 19, 2021Updated 4 years ago
- code to reproduce the empirical results in the research paper☆38Oct 12, 2021Updated 4 years ago
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago