hmomin / PPO-Winter-RunLinks
Trains an agent with Proximal Policy Optimization (PPO) to beat Winter Run
☆21Updated 3 years ago
Alternatives and similar repositories for PPO-Winter-Run
Users that are interested in PPO-Winter-Run are comparing it to the libraries listed below
Sorting:
- ☆52Updated 2 years ago
- A C++ pytorch implementation of MuZero☆39Updated last year
- AlphaZero in JAX☆78Updated last year
- Open AI gym environment for the game 2048☆73Updated 3 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆12Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆69Updated last year
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆261Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- PySC2 OpenAI Gym Environments☆48Updated 6 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆37Updated 5 months ago
- Efficient baselines for autocurricula in JAX.☆192Updated 11 months ago
- Collection of RL Environments built using Madrona☆35Updated 2 years ago
- A curated list of reinforcement learning environments and frameworks.☆51Updated 6 years ago
- fast + parallel AlphaZero in JAX☆97Updated 7 months ago
- Pure Python Library for ES-HyperNEAT. Contains implementations of HyperNEAT and ES-HyperNEAT.☆117Updated last year
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆262Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆34Updated last year
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 7 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Classic MCTS example with mctx☆21Updated 2 years ago
- PyLoL OpenAI Gym Environments for League of Legends v4.20 RL Environment (LoLRLE)☆25Updated 3 years ago
- PPO implementation for OpenAI gym environment based on Unity ML Agents☆149Updated 7 years ago
- A leaderboard of human and machine performance on the Arcade Learning Environment (ALE).☆21Updated 6 years ago
- Framework to build and train RL algorithms☆39Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago