vcadillog / PPO-Mario-Bros-Tensorflow-2
A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.
☆20Updated 4 years ago
Related projects: ⓘ
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆96Updated 2 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆71Updated 3 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆64Updated last month
- ☆39Updated 4 years ago
- ☆13Updated last year
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆119Updated 3 years ago
- ☆175Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆201Updated last year
- ☆41Updated 5 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆99Updated 7 months ago
- Reinforcement Learning for Gym CarRacing-v0 with PyTorch☆147Updated 5 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆102Updated last year
- 🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄☆102Updated 9 months ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆152Updated this week
- Tensorflow 2 Reinforcement Learning Cookbook, published by Packt☆187Updated last year
- OpenAI gym-based algorithm for the grid world problem☆28Updated 3 years ago
- An RL agent for the Google Football environment☆90Updated 3 years ago
- Ray RLlib tutorial material☆114Updated 2 years ago
- A Reinforcement Learning Project using PPO + LSTM☆37Updated last year
- ☆46Updated 5 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- ☆12Updated 3 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago