xkiwilabs / DQN-using-PyTorch-and-ML-Agents
A simple example of how to implement vector based DQN using PyTorch and a ML-Agents environment
☆91Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for DQN-using-PyTorch-and-ML-Agents
- Curiosity-driven Exploration by Self-supervised Prediction☆134Updated last year
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆73Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆296Updated 2 years ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆305Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆225Updated 4 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆38Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆192Updated 2 years ago
- Gridworld for MARL experiments☆137Updated 3 years ago
- Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)☆215Updated 2 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆96Updated 5 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- ☆91Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆271Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆117Updated 3 months ago
- Solving POMDP using Recurrent networks☆82Updated 4 years ago
- Gridworld environments for OpenAI gym.☆80Updated 9 months ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 3 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆81Updated 3 years ago
- ☆190Updated last year