Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for bmpo
Users that are interested in bmpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 20, 2020Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆28Jul 19, 2023Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆555Nov 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NeurIPS Reproducibility Challenge 2019☆21Feb 25, 2020Updated 6 years ago
- ☆15Sep 14, 2020Updated 5 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Mar 19, 2026Updated 3 months ago
- ☆16Jun 30, 2019Updated 7 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- ☆18Apr 11, 2024Updated 2 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆20Dec 22, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- From simulation to real world using deep generative models☆19Sep 30, 2018Updated 7 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆158Aug 31, 2021Updated 4 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆476Jul 6, 2023Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆70Jun 30, 2019Updated 7 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆47Sep 20, 2023Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆94Sep 13, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Guided policy search in Python and ROS Indigo.☆26Feb 12, 2026Updated 4 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- ☆15Apr 5, 2023Updated 3 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆188Apr 12, 2022Updated 4 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆29Feb 8, 2020Updated 6 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Model-Based Visual Planning with Self-Supervised Functional Distances (ICLR 2021)☆20Jul 31, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- A generic tensorflow library for robotics: a bridge between robotics problem and modern machine learning architecture. Provides forward k…☆13Apr 12, 2024Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆23Nov 22, 2025Updated 7 months ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago