cookbenjamin / DDPG
Clean Python Implementation of the Deep Deterministic Policy Gradients Algorithm
☆73Updated 8 years ago
Alternatives and similar repositories for DDPG:
Users that are interested in DDPG are comparing it to the libraries listed below
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆115Updated 2 years ago
- ☆40Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆75Updated 5 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆286Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆88Updated last year
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆94Updated last year
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆207Updated 5 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 4 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- ☆185Updated 3 years ago
- Multi-Objective Reinforcement Learning☆269Updated 3 years ago
- Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)☆338Updated 4 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆67Updated 10 months ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆152Updated last year
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆169Updated last year
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆22Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆304Updated 9 months ago
- Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment☆25Updated 4 years ago
- The code for maddpg using pytorch☆167Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆312Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆143Updated 10 months ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- ☆41Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆140Updated 6 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆82Updated 7 years ago