linhlpv / awesome-offline-to-online-RL-papers
A list of Offline to Online RL papers (continually updated)
☆22Updated last week
Related projects: ⓘ
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆67Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆112Updated 2 years ago
- Representation Learning for RL☆110Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆142Updated 3 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆68Updated last month
- Conservative Q Learning on top of SAC☆118Updated last year
- A list of papers regarding generalization in (deep) reinforcement learning☆141Updated last year
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆22Updated 2 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- ☆51Updated last year
- ☆71Updated last year
- DMControl Generalization Benchmark☆166Updated 8 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆68Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆65Updated last week
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆12Updated 11 months ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆161Updated last week
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆23Updated last year
- A PyTorch implementation of Implicit Q-Learning☆66Updated 2 years ago
- ☆82Updated 8 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆108Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆24Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆61Updated 2 weeks ago
- Simple maze environments using mujoco-py☆52Updated 8 months ago
- ☆46Updated last year
- ☆45Updated 5 months ago