x35f / unstable_baselinesLinks

Re-implementations of SOTA RL algorithms.

☆134

Alternatives and similar repositories for unstable_baselines

Users that are interested in unstable_baselines are comparing it to the libraries listed below

Sorting:

Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
LAMDA-RL / OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
☆75Updated 5 months ago
liyc-ai / RL-pytorch
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
☆26Updated last week
xionghuichen / RLAssistant
RLA is a tool for managing your RL experiments automatically
☆71Updated 2 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆146Updated last year
apexrl / Batch-Offline--RL-Paper-Lists
Paper Collection for Batch RL with brief introductions.
☆84Updated 3 years ago
polixir / NeoRL
Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets
☆124Updated 8 months ago
snu-mllab / EDAC
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆75Updated 2 years ago
liuzuxin / OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆207Updated 10 months ago
ryanxhr / POR
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆57Updated 2 years ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆87Updated last year
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
☆112Updated 2 years ago
instadeepai / og-marl
Datasets with baselines for Offline MARL.
☆176Updated last week
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆191Updated 8 months ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
typoverflow / UtilsRL
A python module designed for agile RL algorithm developing.
☆26Updated last year
gwthomas / IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
☆83Updated 3 years ago
dmksjfl / MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆58Updated last year
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆136Updated last year
yihaosun1124 / mobile
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆19Updated last year
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆171Updated 8 months ago
rll-research / BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
☆129Updated 3 years ago
jiangsy / mbpo_pytorch
☆29Updated 3 years ago
young-geng / CQL
Conservative Q Learning on top of SAC
☆132Updated 2 years ago
FanmingL / ESCP
Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy
☆20Updated 3 years ago
xionghuichen / MAPLE
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
☆25Updated last year
x35f / meta_rl
Meta RL codebase for Unstable Baselines
☆21Updated 2 years ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆71Updated last year
microsoft / ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆70Updated 2 years ago
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆329Updated 11 months ago