freefuiiismyname / ddz-ai
以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai
☆92Updated 3 years ago
Alternatives and similar repositories for ddz-ai:
Users that are interested in ddz-ai are comparing it to the libraries listed below
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- bert4keras实现gpt下中国象棋☆43Updated 4 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆11Updated last year
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- 香侬科技(北京香侬慧语科技有限责任公司)知乎爆料备份☆41Updated 4 years ago
- ☆30Updated 4 years ago
- Finetune CPM-1☆75Updated last year
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…☆34Updated 6 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 5 years ago
- A dual learning toolkit developed by Microsoft Research☆70Updated last year
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆123Updated 5 years ago
- 中文生成式预训练模型☆98Updated 4 years ago
- A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.☆68Updated 4 years ago
- Natural Language Procesing☆34Updated 3 years ago
- 200行写一个自动微分工具☆50Updated 5 years ago
- ☆3Updated last month
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- (Beta Version!) Experiment Code for Paper ``CoT: Cooperative Training for Generative Modeling of Discrete Data''☆73Updated 5 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- 书籍《现代自然语言生成》介绍☆216Updated 4 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆31Updated 4 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- ☆18Updated 5 years ago
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 3 years ago