vcadillog / PPO-Mario-Bros-Tensorflow-2Links
A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.
☆21Updated 5 years ago
Alternatives and similar repositories for PPO-Mario-Bros-Tensorflow-2
Users that are interested in PPO-Mario-Bros-Tensorflow-2 are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- ☆184Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- ☆13Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆142Updated 6 years ago
- Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning☆65Updated 2 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51☆305Updated 5 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆130Updated 8 months ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆99Updated 5 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆311Updated 2 years ago
- Reinforcement learning tutorials☆386Updated 2 years ago
- Tensorflow 2 Reinforcement Learning Cookbook, published by Packt☆195Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆183Updated 10 months ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆194Updated 2 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments