secury / optidice
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆13Updated last year
Related projects ⓘ
Alternatives and complementary repositories for optidice
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- ☆52Updated last year
- ☆13Updated last year
- ☆14Updated last year
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆17Updated last year
- ☆17Updated 2 years ago
- ☆17Updated 6 months ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 2 years ago
- Learning from Trajectories via Subgoal Discovery☆13Updated 3 years ago
- ☆31Updated 3 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆16Updated 3 years ago
- ☆47Updated last year
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆30Updated 8 months ago
- ☆13Updated 7 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- ☆18Updated last year
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆22Updated 3 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆18Updated 2 years ago
- ☆20Updated last year
- [NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning☆52Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- ☆24Updated last year
- ☆13Updated last year
- ☆21Updated last week
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆23Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago