vcadillog / PPO-Mario-Bros-Tensorflow-2
A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.
☆21Updated 5 years ago
Alternatives and similar repositories for PPO-Mario-Bros-Tensorflow-2:
Users that are interested in PPO-Mario-Bros-Tensorflow-2 are comparing it to the libraries listed below
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆68Updated 8 months ago
- ☆41Updated 5 years ago
- ☆183Updated 3 years ago
- Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning☆64Updated 2 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆174Updated 8 months ago
- MADDPG in Ray/RLlib☆53Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆306Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- ☆13Updated 4 years ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆218Updated 2 months ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 4 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆308Updated 2 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- Ray RLlib tutorial material☆116Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Code for Weighted QMIX☆134Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆152Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆38Updated 2 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆175Updated 6 months ago
- An RL agent for the Google Football environment☆95Updated 3 years ago