Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for bmpo
Users that are interested in bmpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆28Jul 19, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆548Nov 22, 2022Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆21Feb 25, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Sep 14, 2020Updated 5 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Mar 19, 2026Updated 2 months ago
- ☆16Jun 30, 2019Updated 6 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Model-Based RL Demo for Pendulum-v0☆13Jun 16, 2020Updated 5 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- ☆18Apr 11, 2024Updated 2 years ago
- From simulation to real world using deep generative models☆18Sep 30, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆158Aug 31, 2021Updated 4 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆475Jul 6, 2023Updated 2 years ago
- Implementation of Stochastic Gaussian Process Motion Planning algorithm, IROS 2022.☆20Oct 30, 2023Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆70Jun 30, 2019Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Sep 20, 2023Updated 2 years ago
- Reinforcement learning algorithms in RLlib☆59May 3, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Guided policy search in Python and ROS Indigo.☆26Feb 12, 2026Updated 4 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- ☆15Apr 5, 2023Updated 3 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆188Apr 12, 2022Updated 4 years ago
- The code used to power DeepRole☆38Nov 21, 2022Updated 3 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆29Feb 8, 2020Updated 6 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Model-Based Visual Planning with Self-Supervised Functional Distances (ICLR 2021)☆20Jul 31, 2021Updated 4 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆23Nov 22, 2025Updated 6 months ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago