bojone / adafactor
adafactor optimizer for keras
☆20Updated 3 years ago
Alternatives and similar repositories for adafactor
Users that are interested in adafactor are comparing it to the libraries listed below
Sorting:
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- saving memory by recomputing for keras☆37Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- AI Challenger 2018 阅读理解赛道代码分享☆21Updated 6 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- pytorch版bert权重转tf☆21Updated 4 years ago
- 2019达观杯实体识别☆19Updated 5 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 4 years ago
- 目前只有阅读理解赛道的☆14Updated 4 years ago
- Adversarial Training for NLP in Keras☆46Updated 5 years ago
- implementation for comparison of supervised Learning to Match methods for product search☆37Updated 4 years ago
- 高校赛2019 文本点击预测☆42Updated 5 years ago
- CLUE Emotion Analysis Dataset 细粒度情感分析数据集☆8Updated 5 years ago
- ☆23Updated 4 years ago
- 2019中国高校计算机大赛——大数据挑战赛 第一名解决方案☆42Updated 4 years ago
- pytorch学习笔记☆8Updated 6 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 4 years ago
- bert4keras实现gpt下中国象棋☆44Updated 4 years ago
- 无监督文本生成的一些方法☆48Updated 3 years ago
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆50Updated 5 years ago
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Updated last year
- TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots (CoNLL2019)☆25Updated 5 years ago
- 高质量闲聊数据介绍☆29Updated 6 years ago
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- ☆59Updated 5 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding 论文的中文翻译 Paper Chinese Translation!☆49Updated 5 years ago
- 2021搜狐校园文本匹配算法大赛baseline☆45Updated 4 years ago
- Dilation Gate CNN For Machine Reading Comprehension☆17Updated 2 years ago