pekaalto / DQN
Deep-Q-Network reinforcement learning algorithm applied to a simple 2d-car-racing environment
☆48Updated 7 years ago
Related projects: ⓘ
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆110Updated 2 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- ☆91Updated 3 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- Using PyTorch and DDPG to play Torcs☆51Updated 5 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆111Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated last year
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated last year
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆195Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinf…☆176Updated 2 years ago
- PyTorch implementation of CommNet☆36Updated 6 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 3 years ago
- Safe Reinforcement Learning algorithms☆69Updated 2 years ago
- Implementation of Linear Inverse Reinforcement Learning Algorithm (IRL) on Mountain Car Environment.☆29Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 4 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆98Updated 3 years ago
- ☆15Updated 4 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 6 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆119Updated 3 years ago
- ☆76Updated 6 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- ☆80Updated 5 years ago
- Code for hierarchical imitation learning and reinforcement learning☆282Updated 6 years ago
- Reinforcement learning algorithms A2C, A3C and DQN☆14Updated 11 months ago