moonbings / connect6_rl
Connect6 AI based on reinforcement learning
☆12Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for connect6_rl
- Minimal version of DeepMind AlphaZero☆81Updated 3 years ago
- ☆40Updated 4 years ago
- ☆67Updated last year
- ☆10Updated 6 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆85Updated last month
- ☆49Updated 5 years ago
- RLOpensource / IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures☆36Updated 5 years ago
- 강화학습에 대한 기본적인 알고리즘 구현☆117Updated 6 years ago
- Tic Tac Toe with Alpha Zero method - My first work☆16Updated 6 years ago
- machine learning project using DeepMind's PySc2☆12Updated 7 years ago
- A repository for implementation of deep reinforcement learning lectured at Samsung☆106Updated 3 years ago
- implementation of distributed reinforcement learning with distributed tensorflow☆57Updated 3 years ago
- weekly reinforcement learning paper reviews☆31Updated 6 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Updated 6 years ago
- AlphaZero training framework for game Connect6 written in Rust with C++, Python interface.☆17Updated 5 years ago
- Reinforcement Learning Tutorial on Super Mario☆90Updated 7 years ago
- An OpenAI Gym interface to The Legend of Zelda on the NES.☆24Updated 4 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆88Updated 6 years ago
- The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141☆56Updated 3 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆68Updated 3 years ago
- OpenAI Gym Style Tic-Tac-Toe Environment☆69Updated 3 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆26Updated last year
- Repository for studying distributional rl☆30Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆189Updated 4 years ago
- ☆57Updated 5 years ago
- Distributed Priortized Experience Replay☆10Updated 6 years ago
- ☆69Updated 5 years ago
- PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)☆153Updated 5 years ago
- 강화학습을 이용한 슈퍼마리오 만들기 튜토리얼 입니다.☆54Updated 6 years ago