sail-sg / PatchAIL
Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>
☆12Updated last year
Related projects: ⓘ
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆22Updated 2 years ago
- ☆29Updated 3 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆15Updated 3 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆13Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- ☆69Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆59Updated 2 months ago
- Learning from Trajectories via Subgoal Discovery☆13Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Updated 2 years ago
- ☆20Updated 11 months ago
- ☆52Updated 3 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆34Updated 11 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆27Updated 6 months ago
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆50Updated last year
- ☆18Updated 7 months ago
- A version of the DeepMind Control Suite with randomly generated graphics, for measuring visual generalization in continuous control.☆16Updated 3 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆24Updated last year
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆22Updated 3 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- ☆12Updated 5 months ago
- ☆17Updated 2 years ago
- ☆21Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆14Updated 5 months ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆19Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 10 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆29Updated last year