freefuiiismyname / ddz-ai
以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai
☆93Updated 4 years ago
Alternatives and similar repositories for ddz-ai
Users that are interested in ddz-ai are comparing it to the libraries listed below
Sorting:
- adafactor optimizer for keras☆20Updated 3 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- ☆30Updated 5 years ago
- Natural Language Procesing☆34Updated 4 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- saving memory by recomputing for keras☆37Updated 5 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆124Updated 6 years ago
- 200行写一个自动微分工具☆51Updated 5 years ago
- This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…☆33Updated 6 years ago
- bert4keras实现gpt下中国象棋☆44Updated 4 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆10Updated 2 years ago
- Playing Wechat Jump Game with End-to-End Convolutional Neural Networks☆180Updated 7 years ago
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 4 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 4 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- ☆31Updated 6 years ago
- Sequential Matching Network implemented by MXNET☆18Updated 6 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆120Updated 5 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 8 years ago
- Finetune CPM-1☆74Updated 2 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- Website for CBVRP Grand Challenge in ACM Multimedia 2019☆32Updated 5 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding 论文的中文翻译 Paper Chinese Translation!☆49Updated 5 years ago
- 香侬科技(北京香侬慧语科技有限责任公司)知乎爆料备份☆41Updated 4 years ago
- A dual learning toolkit developed by Microsoft Research☆71Updated last year
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆79Updated 6 years ago