tinkoff-ai / CORLLinks

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

☆1,223

Alternatives and similar repositories for CORL

Users that are interested in CORL are comparing it to the libraries listed below

Sorting:

corl-team / CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆559Updated last year
Farama-Foundation / D4RL
A collection of reference environments for offline reinforcement learning
☆1,510Updated 7 months ago
takuseno / d3rlpy
An offline deep reinforcement learning library
☆1,497Updated last month
hanjuku-kaso / awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
☆989Updated last year
yihaosun1124 / OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
☆345Updated last year
pranz24 / pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
☆885Updated 3 years ago
quantumiracle / Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…
☆1,256Updated 3 months ago
ikostrikov / jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
☆686Updated 2 years ago
denisyarats / pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
☆552Updated 3 years ago
aviralkumar2907 / CQL
Code for conservative Q-learning
☆446Updated 3 years ago
facebookresearch / mbrl-lib
Library for Model Based RL
☆1,003Updated 11 months ago
Stable-Baselines-Team / stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
☆609Updated last week
vwxyzjn / ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
☆792Updated last year
araffin / rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
☆680Updated 2 years ago
rail-berkeley / softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,310Updated last year
HumanCompatibleAI / imitation
Clean PyTorch implementations of imitation and reward learning algorithms
☆1,524Updated 5 months ago
clvrai / awesome-rl-envs
☆1,197Updated last year
chauncygu / Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
☆654Updated 2 months ago
Farama-Foundation / Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
☆401Updated 2 weeks ago
sfujim / BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆632Updated 4 years ago
araffin / sbx
SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms
☆467Updated this week
google-research / rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
☆834Updated 10 months ago
Stable-Baselines-Team / stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
☆301Updated 2 years ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆361Updated 3 years ago
nikhilbarhate99 / min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆275Updated 3 years ago
uoe-agents / epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
☆603Updated 9 months ago
Kaixhin / imitation-learning
Imitation learning algorithms
☆538Updated 3 months ago
google-research / batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆551Updated 2 years ago
PKU-Alignment / safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
☆465Updated 4 months ago
sfujim / TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
☆1,893Updated last year