IgnacioCarlucho / DDPG_MountainCarLinks

The continuous mountain car problem solved with DDPG

☆13

Alternatives and similar repositories for DDPG_MountainCar

Users that are interested in DDPG_MountainCar are comparing it to the libraries listed below

Sorting:

BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
jsztompka / MultiAgent-PPO
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆29Updated 6 years ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 4 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
Bigpig4396 / PyTorch-Deep-Recurrent-Q-Learning-DRQN
☆41Updated 6 years ago
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 4 months ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆109Updated 4 years ago
ovechkin-dm / ppo-lstm-parallel
ppo-lstm-parallel
☆46Updated 6 years ago
namidairo777 / Distributed-MADDPG
Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.
☆106Updated 4 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 5 years ago
createamind / DRL
☆92Updated 4 years ago
LinghengMeng / LSTM-TD3
The implementation of LSTM-TD3.
☆82Updated 2 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆180Updated last year
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated last week
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆92Updated last year
HaiyinPiao / pytorch-a2clstm-DRQN
using recurrent networks(LSTM) to solve POMDPs
☆35Updated 6 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
deligentfool / dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
☆93Updated 4 years ago
deligentfool / policy_based_RL
The implement of the policy gradient RL algorithm with pytorch
☆39Updated 4 years ago
cardwing / Codes-for-RL-PER
A novel DDPG method with prioritized experience replay (IEEE SMC 2017)
☆50Updated 6 years ago
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated last year
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 6 years ago