zhuliquan / tictactoe_mctsLinks
使用蒙特卡洛树搜索玩tietactoe游戏
☆18Updated 7 years ago
Alternatives and similar repositories for tictactoe_mcts
Users that are interested in tictactoe_mcts are comparing it to the libraries listed below
Sorting:
- 《李宏毅机器学习》课程笔记(暂停更新) | Notes for Hung-yi-Lee Machine Learning Spring 2019 (Suspension)☆24Updated 5 years ago
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- Source code for paper Classification with Costly Features using Deep Reinforcement Learning.☆56Updated 3 years ago
- Tutorial for Graph AutoEncoder implemented by TensorFlow 1.x and 2.x☆42Updated 4 years ago
- Context-Aware Multi-Modal Transportation Recommendation☆38Updated 6 years ago
- Source codes for the book "Application of Neural Network and PyTorch"☆155Updated 2 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 7 years ago
- 把因果思维融入机器学习中☆81Updated 5 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆62Updated 11 months ago
- Reproducing Shalit et al.'s Individual Treatment Effect model. This is a deep neural net that can be applied to various problems in causa…☆18Updated 3 years ago
- Python implementation of Spectral Clustering.☆67Updated 7 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆125Updated 6 years ago
- ML Records in 1110 Lab of BUPT. Some detailed information can be referenced on: https://mathpretty.com/10388.html☆235Updated 2 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 5 years ago
- ☆65Updated 3 years ago
- reinforcement learning☆38Updated 7 years ago
- This is a PyTorch implementation of the GeniePath model in <GeniePath: Graph Neural Networks with Adaptive Receptive Paths> (https://arxi…☆105Updated last year
- 斯坦福 cs234 强化学习中文讲义☆205Updated 4 years ago
- Temporal IMLinUCB - a solution for Online Influence Maximization problem in Temporal Networks (based on IMLinUCB)☆16Updated last year
- Graph Convolutional Networks, Graph Attention Networks, Gated Graph Neural Net, Mixhop☆31Updated 5 years ago
- ☆25Updated 4 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.☆32Updated 6 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆265Updated 6 years ago
- Implementation of Learning Combinatorial Optimization Algorithms over Graphs, by Hanjun Dai et al. (2017)☆33Updated 7 years ago
- bilinear graph neural network☆30Updated 5 years ago
- 本项目以一个可视化配置的、以AgentRL为核心的强化学习框架,实现30分钟上手AgentRL 编程。后续增加AgentRL和本地Agent、MCP、A2A相关特性。☆76Updated last month
- Course Materials for ML Course at Tsinghua☆26Updated 5 years ago
- Graph-based Reinforcement Learning☆16Updated 7 years ago
- 深度学习和NLP随笔☆26Updated 6 years ago