Model-based Offline Policy Optimization re-implement all by pytorch
☆42Sep 13, 2023Updated 2 years ago
Alternatives and similar repositories for mopo
Users that are interested in mopo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 5, 2024Updated 2 years ago
- Implementation for POET and POET-X for LLM pretraining☆34Jun 7, 2026Updated last week
- ☆10Mar 11, 2024Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 4 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆392May 2, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆26Feb 28, 2022Updated 4 years ago
- ☆35May 24, 2023Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- ☆11Nov 18, 2023Updated 2 years ago
- ☆24May 20, 2025Updated last year
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Apr 17, 2024Updated 2 years ago
- [RSS'26] HoMMI: Learning Whole-Body Mobile Manipulation from Human Demonstrations☆128May 21, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Re-implementations of SOTA RL algorithms.☆137Sep 7, 2023Updated 2 years ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- This repository contains the raw data used in "A Multi-Agent Reinforcement Learning Approach to Price and Comfort Optimization in HVAC-Sy…☆11Jan 14, 2022Updated 4 years ago
- Implementation of "Reinforcement Learning in Possibly Nonstationary Environments"☆10Mar 10, 2025Updated last year
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆33Jun 2, 2023Updated 3 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆33Dec 8, 2023Updated 2 years ago
- ☆43May 25, 2023Updated 3 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Feb 21, 2022Updated 4 years ago
- ☆11Mar 15, 2023Updated 3 years ago
- Implementation of bagging-based ensemble for solar irradiance prediction. Base learners used in ensemble learning is stacked-LSTM☆14Aug 28, 2020Updated 5 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆40Aug 17, 2022Updated 3 years ago
- Translate from: https://jax.readthedocs.io/en/latest☆52Mar 29, 2021Updated 5 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Apr 6, 2023Updated 3 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆197Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Flarum WeChat Login extension☆14Oct 19, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 3 years ago
- ☆18Jan 3, 2020Updated 6 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- ☆16Feb 22, 2021Updated 5 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 6 months ago