IgnacioCarlucho / DDPG_MountainCar
The continuous mountain car problem solved with DDPG
☆13Updated 4 years ago
Related projects: ⓘ
- Collection of OpenAI parametrized action-space environments.☆55Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- Pytorch version of the MPC in model-based reinforcement learning (MBRL), currently only test in the CartPole-swing-up environment☆72Updated 4 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆34Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆34Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆108Updated 6 months ago
- The implementation of LSTM-TD3.☆60Updated last year
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆22Updated 5 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆20Updated 3 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆110Updated 2 years ago
- The hierarchy reinforcement learning algorithm(based on DDPG)☆10Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- A clean and robust Pytorch implementation of TD3 on continuous action space☆20Updated 3 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- PyTorch implementation of DDPG for continuous control tasks.☆41Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆94Updated 4 years ago
- ☆13Updated 4 years ago
- A curated list of awesome Model-based reinforcement learning resources☆88Updated 4 years ago
- Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0☆12Updated 4 years ago
- using recurrent networks(LSTM) to solve POMDPs☆33Updated 5 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆66Updated 5 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆50Updated last year
- There will be updates later☆79Updated 5 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆53Updated 3 months ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆79Updated 4 years ago