xkiwilabs / DQN-using-PyTorch-and-ML-AgentsLinks
A simple example of how to implement vector based DQN using PyTorch and a ML-Agents environment
☆93Updated 6 years ago
Alternatives and similar repositories for DQN-using-PyTorch-and-ML-Agents
Users that are interested in DQN-using-PyTorch-and-ML-Agents are comparing it to the libraries listed below
Sorting:
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆218Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 4 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆142Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆210Updated 2 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆299Updated last year
- Gridworld environments for OpenAI gym.☆79Updated last year
- Proximal Policy Optimization implementation with TensorFlow☆107Updated 7 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆76Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Updated 6 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆326Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆100Updated 6 years ago
- ☆91Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 6 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆352Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- Gridworld for MARL experiments☆141Updated 4 years ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆194Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Updated 6 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆115Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆73Updated 8 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- ☆77Updated 7 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Updated 5 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆105Updated 5 years ago
- Random Network Distillation pytorch☆254Updated 6 years ago