chenran-li / RQL-release
(NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value
☆27Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for RQL-release
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated 8 months ago
- ☆20Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆21Updated 2 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆22Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 3 months ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated last year
- ☆22Updated 7 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆59Updated 8 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆34Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆14Updated 6 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆24Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆44Updated 6 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆73Updated 6 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆30Updated last year
- ☆18Updated 2 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆23Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆30Updated this week
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- ☆22Updated 5 months ago
- ☆13Updated last year
- ☆14Updated 8 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 7 months ago
- PWM: Policy Learning with Large World Models☆37Updated 3 months ago
- ☆62Updated 5 months ago
- ☆55Updated last month