Baichenjia / UTDS
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for UTDS
- ☆76Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆70Updated 2 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage-guided policy regulariz…☆23Updated 5 months ago
- Official implementation of the BRO algorithm☆10Updated 3 weeks ago
- ☆11Updated last month
- D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.☆21Updated last year
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆29Updated last year
- ☆53Updated last week
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆52Updated 2 years ago
- CORRO code☆34Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year
- This repository provides a survey on the applications of deep generative models for offline reinforcement learning and imitation learning…☆33Updated 3 months ago
- This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"☆92Updated 2 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆113Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆52Updated 6 months ago
- A PyTorch implementation of Implicit Q-Learning☆66Updated 3 years ago
- ☆17Updated 7 months ago
- ☆26Updated last year
- ☆52Updated last year
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆32Updated 3 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆17Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆90Updated last year
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆29Updated last year
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- ☆14Updated last year
- ☆39Updated 2 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆41Updated 2 years ago
- ☆22Updated 9 months ago