spideralessio / TRPO-Tensorflow2
TRPO Implementation in Tensorflow 2.0 for Reinforcement Learning Project @ Sapienza
☆16Updated 2 years ago
Alternatives and similar repositories for TRPO-Tensorflow2:
Users that are interested in TRPO-Tensorflow2 are comparing it to the libraries listed below
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆118Updated 5 months ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆286Updated 4 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆79Updated 4 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- PyTorch implementation of DDPG for continuous control tasks.☆46Updated 5 years ago
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆111Updated 2 years ago
- 2D Gridworld navigation using RL with Hindsight Experience Replay☆45Updated 5 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆104Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- Reinforcement Learning for Gym CarRacing-v0 with PyTorch☆155Updated 6 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago
- ☆85Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆139Updated 6 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- Deep Recurrent Attention Reinforcement Learning in Atari☆84Updated 6 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- ☆185Updated 3 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆40Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- General-Purpose Reinforcement Learning☆17Updated 3 years ago