sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
β141Updated last year
Alternatives and similar repositories for TD7:
Users that are interested in TD7 are comparing it to the libraries listed below
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ88Updated 7 months ago
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ196Updated 7 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β166Updated 5 months ago
- β95Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"β69Updated 10 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.β162Updated last week
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β61Updated 10 months ago
- A PyTorch implementation of Implicit Q-Learningβ80Updated 3 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.β65Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ167Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTTβ139Updated 11 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memoryβ174Updated 10 months ago
- Synthetic Experience Replayβ91Updated 10 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and coβ¦β135Updated 11 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.β66Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmarβ¦β124Updated last year
- π A fast safe reinforcement learning library in PyTorchβ183Updated 6 months ago
- β55Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β82Updated last year
- Benchmarked implementations of Offline RL Algorithms.β73Updated last month
- β264Updated 3 years ago
- Model-based Offline Policy Optimization re-implement all by pytorchβ31Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β52Updated 2 years ago
- Prioritized Experience Replay implementation with proportional prioritizationβ77Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016β126Updated 8 months ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.β124Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β121Updated 3 years ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"β68Updated 2 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learningβ161Updated 9 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuningβ92Updated 8 months ago