spideralessio / TRPO-Tensorflow2
TRPO Implementation in Tensorflow 2.0 for Reinforcement Learning Project @ Sapienza
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TRPO-Tensorflow2
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆73Updated 4 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆271Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆114Updated 2 weeks ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆296Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆96Updated 2 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆30Updated 5 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- ☆181Updated 2 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆99Updated 3 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆300Updated 2 years ago
- ☆41Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆74Updated 10 months ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆21Updated 4 years ago
- The implementation of LSTM-TD3.☆64Updated last year
- Reinforcement Learning for Gym CarRacing-v0 with PyTorch☆149Updated 5 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆96Updated 5 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- scalable multi agents reinforcement learning☆54Updated 6 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- A collection of pre-trained RL agents using Stable Baselines3☆105Updated 2 weeks ago