sfujim / TD7Links
Author's PyTorch implementation of TD7 for online and offline RL
β145Updated last year
Alternatives and similar repositories for TD7
Users that are interested in TD7 are comparing it to the libraries listed below
Sorting:
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ205Updated 9 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ169Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β168Updated 7 months ago
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ94Updated 9 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and coβ¦β137Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.β172Updated last month
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016β128Updated 10 months ago
- β102Updated 2 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmarβ¦β136Updated 2 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.β68Updated last year
- β271Updated 3 years ago
- A PyTorch implementation of Implicit Q-Learningβ82Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β57Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β87Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"β75Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memoryβ180Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.β126Updated 3 years ago
- π A fast safe reinforcement learning library in PyTorchβ198Updated 8 months ago
- Prioritized Experience Replay implementation with proportional prioritizationβ81Updated last year
- DSAC; Distributional Soft Actor-Criticβ129Updated 4 months ago
- Model-based Offline Policy Optimization re-implement all by pytorchβ32Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β62Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environmentβ46Updated 9 months ago
- Baseline implementation of recurrent PPO using truncated BPTTβ148Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuningβ98Updated 10 months ago
- β198Updated 2 years ago
- Conservative Q Learning on top of SACβ131Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.β73Updated 3 months ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β123Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RLβ361Updated 3 years ago