bojone / adafactor
adafactor optimizer for keras
☆20Updated 3 years ago
Alternatives and similar repositories for adafactor:
Users that are interested in adafactor are comparing it to the libraries listed below
- AI Challenger 2018 阅读理解赛道代码分享☆21Updated 6 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- CLUE Emotion Analysis Dataset 细粒度情感分析数据集☆8Updated 5 years ago
- Python下shuffle几百G文件☆33Updated 3 years ago
- Adversarial Training for NLP in Keras☆46Updated 5 years ago
- Pytorch implementation of Neural Machine Translation with seq2seq and attention (en-zh)☆41Updated 6 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- Official code of our work, Robust, Transferable Sentence Representations for Text Classification [Arxiv 2018].☆21Updated 6 years ago
- RAdam optimizer for keras☆71Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- 无监督文本生成的一些方法☆48Updated 3 years ago
- pytorch版bert权重转tf☆21Updated 4 years ago
- 精简版NEZHA模型权重☆21Updated 4 years ago
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- 2019达观杯实体识别☆19Updated 5 years ago
- 天池-新冠疫情相似句对判定大赛 大白_Rank6☆21Updated 5 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding 论文的中文翻译 Paper Chinese Translation!☆49Updated 5 years ago
- This is our solution for WSDM - DiggSci 2020. We implemented a simple yet robust search pipeline which ranked 2nd in the validation set a…☆63Updated 4 years ago
- saving memory by recomputing for keras☆37Updated 4 years ago
- 目前只有阅读理解赛道的☆14Updated 4 years ago
- ☆22Updated 6 years ago
- Dilation Gate CNN For Machine Reading Comprehension☆17Updated 2 years ago
- Knowledge Graph based Question Answering benchmark.☆10Updated 5 years ago
- machine reading comprehension with deep learning☆20Updated 7 years ago
- 2019中国高校计算机大赛——大数据挑战赛 第一名解决方案☆41Updated 4 years ago
- lightweighted deep learning inference service framework☆39Updated 3 years ago
- reformer-pytorch中文版本,简单高效的生成模型。类似GPT2的效果☆16Updated last year
- TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots (CoNLL2019)☆25Updated 5 years ago
- 高校赛2019 文本点击预测☆42Updated 5 years ago