IgnacioCarlucho / DDPG_MountainCar
The continuous mountain car problem solved with DDPG
☆13Updated 4 years ago
Alternatives and similar repositories for DDPG_MountainCar:
Users that are interested in DDPG_MountainCar are comparing it to the libraries listed below
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last month
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆18Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated last week
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- ppo-lstm-parallel☆44Updated 6 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 6 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- ☆92Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆166Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- Adversarial Inverse Reinforcement Learning Implement For Mountain Car☆35Updated 3 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆41Updated 4 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago
- ☆41Updated 5 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆14Updated 4 years ago
- ☆25Updated 7 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 8 months ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆58Updated 4 years ago