sfujim / TD7Links
Author's PyTorch implementation of TD7 for online and offline RL
β151Updated 2 years ago
Alternatives and similar repositories for TD7
Users that are interested in TD7 are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β173Updated 11 months ago
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ107Updated last year
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ216Updated last year
- A PyTorch implementation of Implicit Q-Learningβ91Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ179Updated 3 years ago
- β287Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β90Updated last year
- Conservative Q Learning on top of SACβ132Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.β77Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimizationβ22Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)β110Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β60Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and coβ¦β143Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β131Updated 3 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmarβ¦β143Updated 2 years ago
- β55Updated 3 months ago
- Representation Learning for RLβ127Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learningβ36Updated 3 years ago
- Datasets with baselines for Offline MARL.β181Updated 2 months ago
- β57Updated 2 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023β33Updated 10 months ago
- Benchmarked implementations of Offline RL Algorithms.β74Updated 7 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"β79Updated last year
- A collection of offline reinforcement learning algorithms.β201Updated 11 months ago
- Code for MOPO: Model-based Offline Policy Optimizationβ188Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β66Updated last year
- Synthetic Experience Replayβ103Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"β57Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)β77Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016β137Updated last year