alirezakazemipour / DDPG-HERLinks

Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.

☆100

Alternatives and similar repositories for DDPG-HER

Users that are interested in DDPG-HER are comparing it to the libraries listed below

Sorting:

CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆146Updated 3 years ago
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆101Updated 3 years ago
DLR-RM / rl-trained-agents
A collection of pre-trained RL agents using Stable Baselines3
☆134Updated 9 months ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 2 weeks ago
jakegrigsby / super_sac
A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…
☆38Updated last year
araffin / rl-handson-rlvs21
Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.
☆54Updated 2 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
☆149Updated last year
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆88Updated last year
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆71Updated last year
clvrai / spirl
Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020
☆209Updated 2 years ago
schroederdewitt / multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆358Updated 2 years ago
montaserFath / BCO
behavior cloning from observation
☆36Updated 4 years ago
hcnoh / gail-pytorch
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
☆164Updated 3 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆171Updated 8 months ago
toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆222Updated 4 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
denisyarats / pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
☆248Updated 5 years ago
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆64Updated last year
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 4 years ago
ffelten / CrazyRL
JAX and PZ RL envs + algorithms for swarms of CrazyFlies
☆79Updated 11 months ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆289Updated 4 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆146Updated last year
Howuhh / prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
☆81Updated 2 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆111Updated 4 years ago
axelbr / racecar_gym
A gym environment for a miniature racecar using the pybullet physics engine.
☆198Updated last year
alirezakazemipour / Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆29Updated 2 months ago
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆136Updated last year