sfujim / TD7Links
Author's PyTorch implementation of TD7 for online and offline RL
β157Updated 2 years ago
Alternatives and similar repositories for TD7
Users that are interested in TD7 are comparing it to the libraries listed below
Sorting:
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ111Updated last month
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ225Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ182Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β176Updated last year
- A PyTorch implementation of Implicit Q-Learningβ93Updated 4 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.β84Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β93Updated 2 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimizationβ22Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)β115Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmarβ¦β143Updated 2 years ago
- β303Updated 3 years ago
- Conservative Q Learning on top of SACβ132Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β61Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β132Updated 4 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"β82Updated last year
- Benchmarked implementations of Offline RL Algorithms.β76Updated 9 months ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)β79Updated 3 years ago
- Synthetic Experience Replayβ107Updated last year
- β59Updated 2 weeks ago
- Representation Learning for RLβ129Updated 2 years ago
- β58Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learningβ37Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimizationβ190Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β32Updated 2 years ago
- Simple maze environments using mujoco-pyβ57Updated last year
- Datasets with baselines for Offline MARL.β193Updated last month
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizatβ¦β39Updated last year
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorchβ35Updated 4 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β70Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observationsβ112Updated last year