vcadillog / PPO-Mario-Bros-Tensorflow-2
A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.
☆21Updated 5 years ago
Alternatives and similar repositories for PPO-Mario-Bros-Tensorflow-2:
Users that are interested in PPO-Mario-Bros-Tensorflow-2 are comparing it to the libraries listed below
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- Ray RLlib tutorial material☆117Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Updated 4 years ago
- ☆75Updated 5 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆68Updated last week
- Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning☆64Updated 2 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 4 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆45Updated 2 years ago
- ☆185Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆311Updated 3 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆125Updated 5 months ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆27Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- ☆13Updated 4 years ago