GilgameshD / GRADERLinks
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
☆33Updated 2 years ago
Alternatives and similar repositories for GRADER
Users that are interested in GRADER are comparing it to the libraries listed below
Sorting:
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆42Updated 3 years ago
- ☆56Updated 2 years ago
- ☆39Updated 3 years ago
- ☆31Updated 2 years ago
- ☆18Updated 2 years ago
- ☆47Updated 6 months ago
- ☆19Updated last year
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆45Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆34Updated 2 years ago
- ☆26Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- ☆33Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆33Updated 2 years ago
- ☆15Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆27Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆39Updated 2 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 4 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆84Updated 6 months ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Updated 2 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Updated 2 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.☆19Updated 7 months ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- behavior cloning from observation☆34Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆29Updated last year
- Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets☆26Updated last year
- ☆48Updated last year