BY571 / Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
☆41Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Implicit-Q-Learning
- A PyTorch implementation of Implicit Q-Learning☆66Updated 3 years ago
- Conservative Q Learning on top of SAC☆119Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆114Updated last year
- ☆51Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- CORRO code☆34Updated 2 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆52Updated 6 months ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆22Updated last week
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆122Updated 6 months ago
- ☆28Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆114Updated 3 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆81Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆36Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆72Updated 10 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆150Updated 2 weeks ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆92Updated 3 years ago
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆19Updated last year
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- ☆37Updated 2 years ago
- ☆10Updated 4 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆50Updated 4 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆28Updated 8 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆47Updated last year
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- A collection of offline reinforcement learning algorithms.☆158Updated 5 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆94Updated 5 months ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year