BepfCp / RL-pytorch

A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.

☆21

Related projects: ⓘ

ZhengYinan-AIR / OMIGA
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…
☆27Updated 6 months ago
LAMDA-RL / ODIS
The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
☆31Updated last month
DrZero0 / MACC
The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".
☆15Updated 2 years ago
chenf-ai / Multi-Agent-Communication-Considering-Representation-Learning
☆28Updated last year
yihaosun1124 / mobile
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆14Updated 5 months ago
junming-yang / mopo
Model-based Offline Policy Optimization re-implement all by pytorch
☆27Updated last year
ryanxhr / DWBC
[ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
☆34Updated last year
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆45Updated 3 months ago
danielshin1 / oprl
Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning
☆19Updated last year
FanmingL / Recurrent-Offpolicy-RL
Implementation of SAC and TD3 based on various RNN and Transformer.
☆10Updated 3 weeks ago
bkkgbkjb / OPPO
☆20Updated this week
YangRui2015 / RORL
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆17Updated last year
YangRui2015 / RIQL
Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"
☆11Updated 8 months ago
LAMDA-RL / PRDC
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆14Updated 10 months ago
Baichenjia / PBRL
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆28Updated 2 years ago
liuzuxin / DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
☆65Updated this week
FanmingL / ESCP
Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy
☆20Updated 2 years ago
lafmdp / HIDIL
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Updated 2 years ago
mantle2048 / rlplot
rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").
☆26Updated 9 months ago
typoverflow / OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
☆62Updated 4 months ago
PKU-RL / CORRO
CORRO code
☆33Updated 2 years ago
FanmingL / SmartLogger
☆11Updated 4 months ago
shlee94 / Off2OnRL
☆51Updated last year
tianxusky / Code-for-Error-Bounds-of-Imitating-Policies-and-Environments
☆10Updated 3 years ago
ryanxhr / IVR
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆41Updated last year
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆70Updated 9 months ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆108Updated last year
dmksjfl / MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆49Updated 4 months ago
MouseHu / GEM
☆12Updated 3 years ago
ruizhaogit / maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆25Updated last year