hmomin / PPO-Winter-RunLinks
Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run
☆21Updated 3 years ago
Alternatives and similar repositories for PPO-Winter-Run
Users that are interested in PPO-Winter-Run are comparing it to the libraries listed below
Sorting:
- ☆53Updated 2 years ago
- A C++ pytorch implementation of MuZero☆41Updated last year
- AlphaZero in JAX☆80Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆40Updated 9 months ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Updated last year
- AI for the game Uno☆17Updated 6 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated 2 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆115Updated 4 months ago
- A curated list of reinforcement learning environments and frameworks.☆52Updated 6 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆33Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- fast + parallel AlphaZero in JAX☆107Updated 11 months ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Updated 2 years ago
- Gym environment for playing Wordle with RL agents☆43Updated 3 years ago
- Gym wrapper for pysc2☆10Updated 3 years ago
- A structured implementation of MuZero☆206Updated 3 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 3 years ago
- 3D Client for https://github.com/neuralmmo/environment☆224Updated last year
- Reinforcement learning in pure JAX.☆13Updated 9 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- A2C is a special case of PPO!☆22Updated 3 years ago
- An OpenAI gym environment made for RL☆71Updated 2 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Updated 5 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- PPO implementation for OpenAI gym environment based on Unity ML Agents☆150Updated 7 years ago
- Baselines for gymnax 🤖☆73Updated 2 years ago