yangrc1234 / Gomoku-Zero
A gomoku AI based on Alpha Zero paper.
☆12Updated 2 years ago
Alternatives and similar repositories for Gomoku-Zero:
Users that are interested in Gomoku-Zero are comparing it to the libraries listed below
- Deep Learning big homework of UCAS☆37Updated 6 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆17Updated 6 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- some strategies for exposure bias in seq2seq☆18Updated 4 years ago
- 蚂蚁金融自然语言处理竞赛。☆9Updated 6 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆93Updated 4 years ago
- The AlphaZero for the WTN-EinStein Chess☆6Updated 6 years ago
- 目前只有阅读理解赛道的☆14Updated 4 years ago
- 21th place (top2%) solution for kaggle TensorFlow 2.0 Question Answering☆17Updated 5 years ago
- ☆61Updated 6 years ago
- Codes for paper "LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification"☆16Updated 5 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 8 years ago
- ☆23Updated 4 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆206Updated 2 months ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆75Updated 7 years ago
- R-Net with PyTorch☆24Updated 7 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- Enhancing Sentence Embedding with Generalized Pooling☆11Updated 6 years ago
- 我对看过的以及用过的一些nlp方面的神经网络的结构介绍☆23Updated 7 years ago
- An implementation of "Two are Better than One: An Ensemble of Retrieval- and Generation-Based Dialog Systems"☆14Updated 5 years ago
- Code for "A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis." on EMNLP 2019.☆21Updated 5 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- end-to-end dialog system dataset☆12Updated 5 years ago
- Task-oriented Dialog Policy Learning with Multi-Agent Reinforcement Learning☆56Updated 4 years ago
- Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning☆26Updated 6 years ago
- named entity recognition combined with rule from entity dict☆12Updated 4 years ago
- 非常好用的工具包,可以直接安装并使用☆20Updated 3 years ago
- The source code of the paper 'Dynamic Knowledge Routing Network For Target-Guided Open-Domain Conversation'☆24Updated 2 years ago
- 精简版NEZHA模型权重☆21Updated 4 years ago