baturaysaglam / LA3PLinks

Actor Prioritized Experience Replay

☆17

Alternatives and similar repositories for LA3P

Users that are interested in LA3P are comparing it to the libraries listed below

Sorting:

AIDefender / MyDiscor
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14Updated 4 years ago
x35f / meta_rl
Meta RL codebase for Unstable Baselines
☆22Updated 2 years ago
YangRui2015 / RORL
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
☆20Updated 2 years ago
Haichao-Zhang / PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆61Updated 2 years ago
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆141Updated last year
YangRui2015 / Model-basedHER
Model-based Hindsight Experience Replay
☆10Updated 3 years ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆92Updated last year
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆205Updated 11 months ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆181Updated 3 years ago
lucaslingle / pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
☆71Updated 3 years ago
sweetice / PEER-CVPR23
Authors' implementation of PEER
☆11Updated 2 years ago
shlee94 / Off2OnRL
☆58Updated 2 years ago
dmksjfl / MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆60Updated last year
IouJenLiu / CMAE
☆49Updated 4 years ago
MouseHu / GEM
☆14Updated 4 years ago
xtma / dsac
Distributional Soft Actor Critic
☆59Updated 5 years ago
junming-yang / mopo
Model-based Offline Policy Optimization re-implement all by pytorch
☆36Updated 2 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆174Updated last year
felix-kerkhoff / DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
☆32Updated 2 years ago
dmksjfl / DARC
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
☆22Updated 3 years ago
lizhuo-1994 / NECSA
Official implementation of Neural Episodic Control with State Abstraction
☆13Updated 2 years ago
TJU-DRL-LAB / self-supervised-rl
☆43Updated 3 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆105Updated 5 years ago
YangRui2015 / Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
☆90Updated 5 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆153Updated 2 years ago
kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆171Updated last year
chenf-ai / Multi-Agent-Communication-Considering-Representation-Learning
☆30Updated 2 years ago
FanmingL / ESCP
Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy
☆20Updated 3 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆52Updated 6 months ago
seolhokim / DistributedRL-Pytorch-Ray
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Updated 3 years ago