Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for bmpo
Users that are interested in bmpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Code for Goal-Aware Prediction: Learning to Model what Matters☆20Jul 15, 2020Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆21Feb 25, 2020Updated 6 years ago
- ☆399Jul 18, 2019Updated 6 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Updated this week
- ☆16Jun 30, 2019Updated 6 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Feb 3, 2022Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Model-Based RL Demo for Pendulum-v0☆13Jun 16, 2020Updated 5 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- ☆18Apr 11, 2024Updated last year
- From simulation to real world using deep generative models☆18Sep 30, 2018Updated 7 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆20Dec 22, 2021Updated 4 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆157Aug 31, 2021Updated 4 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆472Jul 6, 2023Updated 2 years ago
- Implementation of Stochastic Gaussian Process Motion Planning algorithm, IROS 2022.☆19Oct 30, 2023Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆48Sep 20, 2023Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Reinforcement learning algorithms in RLlib☆59May 3, 2024Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆29Jan 12, 2023Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Guided policy search in Python and ROS Indigo.☆26Feb 12, 2026Updated last month
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- ☆15Apr 5, 2023Updated 2 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆187Apr 12, 2022Updated 3 years ago
- The code used to power DeepRole☆37Nov 21, 2022Updated 3 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Model-Based Visual Planning with Self-Supervised Functional Distances (ICLR 2021)☆20Jul 31, 2021Updated 4 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- A generic tensorflow library for robotics: a bridge between robotics problem and modern machine learning architecture. Provides forward k…☆13Apr 12, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago