sfujim / TD7Links
Author's PyTorch implementation of TD7 for online and offline RL
β161Updated 2 years ago
Alternatives and similar repositories for TD7
Users that are interested in TD7 are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β176Updated last year
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ230Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ183Updated 3 years ago
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ118Updated 2 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β94Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learningβ93Updated 4 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.β85Updated last week
- β308Updated 3 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimizationβ22Updated last year
- Conservative Q Learning on top of SACβ136Updated 3 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)β118Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and coβ¦β146Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β133Updated 4 years ago
- Datasets with baselines for Offline MARL.β199Updated 2 months ago
- β59Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"β84Updated last year
- Benchmarked implementations of Offline RL Algorithms.β76Updated 10 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmarβ¦β143Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β62Updated 2 years ago
- Synthetic Experience Replayβ107Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β32Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learningβ39Updated 3 years ago
- Representation Learning for RLβ130Updated 2 years ago
- β60Updated last month
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016β143Updated last year
- Benchmarking RL generalization in an interpretable way.β174Updated 2 months ago
- Code for MOPO: Model-based Offline Policy Optimizationβ191Updated 3 years ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizatβ¦β39Updated last year
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)β79Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observationsβ113Updated last year