YaoMarkMu / DOMINO_MB-MetaRL
☆20Updated last year
Related projects: ⓘ
- [NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning☆51Updated 11 months ago
- ☆20Updated 11 months ago
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆45Updated 10 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆37Updated 5 months ago
- ☆23Updated last year
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆13Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 6 months ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆40Updated 5 months ago
- official implementation of ODICE☆13Updated 7 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 7 months ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆27Updated 6 months ago
- ☆24Updated last year
- ☆46Updated last year
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆26Updated 9 months ago
- Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regulariz…☆20Updated 3 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆104Updated last year
- curriculum☆19Updated last year
- CORRO code☆33Updated 2 years ago
- ☆14Updated 10 months ago
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆12Updated last year
- ☆71Updated last year
- ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Man…☆15Updated last year
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆17Updated 2 years ago
- ☆11Updated 8 months ago
- ☆20Updated this week
- Implantation of CtrlFormer☆27Updated last year
- ☆11Updated last year
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal R…☆30Updated last year