EricSteinberger / DREAMLinks
Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games
☆119Updated last year
Alternatives and similar repositories for DREAM
Users that are interested in DREAM are comparing it to the libraries listed below
Sorting:
- Scalable Implementation of Neural Fictitous Self-Play☆85Updated 7 years ago
- Scalable Implementation of Deep CFR and Single Deep CFR☆314Updated 5 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆181Updated 6 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Updated 4 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Updated 2 years ago
- Pytorch Implementation of MuZero☆352Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- A structured implementation of MuZero☆206Updated 3 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆103Updated 3 years ago
- ☆66Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆206Updated 3 years ago
- Keeping track of RL experiments☆166Updated 3 years ago
- ☆327Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆37Updated last year
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated 4 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆70Updated 4 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆156Updated 3 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆48Updated 7 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Updated 5 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆35Updated 7 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆188Updated last year
- An environment of the board game Go using OpenAI's Gym API☆177Updated 3 years ago
- Modular framework for Reinforcement Learning in python☆183Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Updated 7 years ago