freefuiiismyname / ddz-ai
以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai
☆92Updated 3 years ago
Alternatives and similar repositories for ddz-ai:
Users that are interested in ddz-ai are comparing it to the libraries listed below
- bert4keras实现gpt下 中国象棋☆43Updated 4 years ago
- ☆30Updated 5 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- Natural Language Procesing☆34Updated 3 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- 200行写一个自动微分工具☆50Updated 5 years ago
- ☆31Updated 5 years ago
- 用强化学习来玩微信跳一跳☆18Updated 7 years ago
- This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…☆33Updated 6 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding 论文的中文翻译 Paper Chinese Translation!☆49Updated 5 years ago
- 深度学习和NLP随笔☆26Updated 5 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆56Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- (Beta Version!) Experiment Code for Paper ``CoT: Cooperative Training for Generative Modeling of Discrete Data''☆73Updated 5 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆55Updated 3 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 5 years ago
- Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition☆122Updated 5 years ago
- 用bert4keras来解小学数学应用题☆77Updated 4 years ago
- 高质量闲聊数据介绍☆29Updated 6 years ago
- saving memory by recomputing for keras☆37Updated 4 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆11Updated last year
- Finetune CPM-1☆75Updated last year
- CLUE Emotion Analysis Dataset 细粒度情感分析数据集☆8Updated 5 years ago
- A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.☆69Updated 4 years ago
- ☆3Updated 2 months ago
- Crawler used to crawl papers☆25Updated 6 years ago
- A.M.C. - Artificial Intelligence & Machine Learning CLUB: Friends for Code, Paper, and Beers!🍻☆120Updated last year