t6-thu / H2O
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
☆52Updated last year
Related projects ⓘ
Alternatives and complementary repositories for H2O
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- ☆14Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆13Updated last year
- xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆14Updated last month
- The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)☆43Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal R…☆32Updated last year
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆33Updated 3 months ago
- ☆52Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆48Updated last year
- Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regulariz…☆23Updated 5 months ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆39Updated 2 years ago
- ☆18Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- ☆17Updated 7 months ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆17Updated 2 years ago
- CORRO code☆34Updated 2 years ago
- D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.☆21Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆30Updated 8 months ago
- ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models☆48Updated 7 months ago
- Implementation of the DreamerV2 agent in torch☆12Updated 2 years ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆11Updated 8 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆113Updated last year
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Updated 2 years ago
- ☆22Updated 9 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆40Updated 9 months ago
- ☆21Updated last week
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆22Updated 3 weeks ago