philippe-eecs / IDQL
Repo for Implicit Diffusion Q-Learning
☆100Updated last year
Alternatives and similar repositories for IDQL:
Users that are interested in IDQL are comparing it to the libraries listed below
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 2 months ago
- Synthetic Experience Replay☆86Updated 8 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆98Updated 8 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆84Updated 6 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆121Updated last week
- ExORL: Exploratory Data for Offline Reinforcement Learning☆108Updated 3 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 7 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆84Updated 2 years ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆76Updated 10 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆62Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆127Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆64Updated 8 months ago
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆29Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆104Updated last year
- Transformer-based World Models☆76Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 10 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated 11 months ago
- ☆39Updated 3 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆42Updated last year
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆109Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆24Updated 5 months ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆18Updated 10 months ago
- ☆70Updated 4 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆77Updated 9 months ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 11 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆72Updated last year