linhlpv / awesome-offline-to-online-RL-papers
A list of Offline to Online RL papers (continually updated)
β34Updated 4 months ago
Alternatives and similar repositories for awesome-offline-to-online-RL-papers:
Users that are interested in awesome-offline-to-online-RL-papers are comparing it to the libraries listed below
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuningβ83Updated 6 months ago
- Conservative Q Learning on top of SACβ122Updated 2 years ago
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ84Updated 4 months ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"β57Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regβ¦β45Updated last year
- Transformer-based World Modelsβ75Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β26Updated last year
- Representation Learning for RLβ122Updated last year
- Synthetic Experience Replayβ84Updated 8 months ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"β83Updated this week
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learningβ21Updated last month
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)β25Updated 3 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observationsβ98Updated 8 months ago
- β20Updated 9 months ago
- Prioritized Experience Replay implementation with proportional prioritizationβ76Updated last year
- β29Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)β52Updated 9 months ago
- β50Updated 10 months ago
- β24Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"β64Updated 7 months ago
- Author's PyTorch implementation of TD7 for online and offline RLβ124Updated last year
- A PyTorch implementation of Implicit Q-Learningβ71Updated 3 years ago
- β57Updated 2 months ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformerβ33Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)β76Updated last month
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β118Updated 3 years ago
- xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editingβ15Updated 3 months ago
- [NeurIPS 2023] Efficient Diffusion Policyβ91Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β51Updated last year
- β37Updated last year