vcadillog / PPO-Mario-Bros-Tensorflow-2Links
A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.
☆21Updated 5 years ago
Alternatives and similar repositories for PPO-Mario-Bros-Tensorflow-2
Users that are interested in PPO-Mario-Bros-Tensorflow-2 are comparing it to the libraries listed below
Sorting:
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆35Updated 3 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- ☆41Updated 5 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆77Updated 4 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆69Updated 3 weeks ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆178Updated 11 months ago
- Distributional Soft Actor Critic☆55Updated 5 years ago
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆213Updated 2 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 4 years ago
- ☆184Updated 3 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆46Updated 9 months ago
- ☆76Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆103Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆52Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- There will be updates later☆85Updated 6 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆142Updated 6 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 5 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆177Updated last year
- Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning☆65Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆50Updated last month
- ☆13Updated 5 years ago
- papers about reinforcement learning☆13Updated 4 years ago