LQNew / Deeper_Larger_Actor-Critic_RL

Pytorch implementation of large network design in continous control RL.

☆19

Alternatives and similar repositories for Deeper_Larger_Actor-Critic_RL:

Users that are interested in Deeper_Larger_Actor-Critic_RL are comparing it to the libraries listed below

LQNew / Continuous_Control_Benchmark
Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.
☆28Updated 4 years ago
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆61Updated 10 months ago
dmksjfl / DARC
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
☆21Updated 3 years ago
Stepan-Makarenko / ICM-PPO-implementation
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
☆15Updated last year
apexrl / GCRL-Collection
This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…
☆124Updated last year
TobiasLv / RAD
☆39Updated 3 weeks ago
martius-lab / HiTS
Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021
☆33Updated 2 years ago
Mee321 / policy-distillation
☆14Updated 5 years ago
yangtao121 / AquaRL
☆16Updated 2 years ago
dmksjfl / MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆55Updated 11 months ago
hari-sikchi / AWAC
Advantage weighted Actor Critic for Offline RL
☆50Updated 2 years ago
MDrW / ICML2022-IRAT
☆39Updated 2 years ago
shlee94 / Off2OnRL
☆55Updated 2 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆107Updated 3 years ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆66Updated last year
fiberleif / POfD
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
☆14Updated 5 years ago
PKU-RL / CORRO
CORRO code
☆35Updated 2 years ago
lich14 / CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆85Updated 2 years ago
trzhang0116 / HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…
☆40Updated last year
montaserFath / BCO
behavior cloning from observation
☆35Updated 4 years ago
SvenGronauer / Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
☆67Updated last year
LinghengMeng / LSTM-TD3
The implementation of LSTM-TD3.
☆79Updated 2 years ago
hzm2016 / option-critic-pytorch
☆12Updated 2 years ago
DesikRengarajan / LOGO
[ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration
☆26Updated 3 years ago
AIDefender / MyDiscor
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆13Updated 3 years ago
LQNew / LWDRLC
Lightweight deep RL Libraray for continuous control.
☆16Updated 3 years ago
Haichao-Zhang / PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆53Updated 2 years ago
alirezakazemipour / Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
☆33Updated 3 years ago
Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆50Updated 2 months ago
xtma / dsac
Distributional Soft Actor Critic
☆52Updated 4 years ago