zhuliquan / tictactoe_mctsLinks
使用蒙特卡洛树搜索玩tietactoe游戏
☆18Updated 6 years ago
Alternatives and similar repositories for tictactoe_mcts
Users that are interested in tictactoe_mcts are comparing it to the libraries listed below
Sorting:
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- 大数据金融课程final☆13Updated 5 years ago
- 主要存储Datawhale组队学习中“强化学习”方向的资料。☆33Updated 4 years ago
- ☆25Updated 3 years ago
- ☆13Updated last year
- Solutions for CS294-112 Fall2018 assignments in Pytorch☆20Updated 6 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆124Updated 6 years ago
- ☆17Updated 6 years ago
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- Frequent Pattern Mining☆36Updated 6 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 6 years ago
- 深度学习和NLP 随笔☆26Updated 6 years ago
- Implementation to VirtualTaobao☆12Updated 5 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- Implementation for our paper in NeurIPS 2019☆48Updated 5 years ago
- Douban Movies Recommendation based on NeuralFM with Tensorflow. This is a project in Social Network Analysis@FDU.☆29Updated 6 years ago
- A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.☆72Updated 5 years ago
- A Distribution-Free Test of Independence Based on Mean Variance Index.☆24Updated last year
- 我的强化学习笔记和学习材料 still updating ... ...☆348Updated 6 years ago
- 《最优化导论》第1 2 3 4 5 6 7 8 9 10 11 13 20 21 22 23章LaTeX公式笔记☆41Updated 6 years ago
- Course Materials for ML Course at Tsinghua☆25Updated 5 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago
- 教材 Causal Inference: What if 的编译和解读!☆23Updated 5 years ago
- Tensorflow implementation for "Generative Adversarial User Model forReinforcement Learning Based Recommendation System"