bojone / adafactorLinks
adafactor optimizer for keras
☆20Updated 4 years ago
Alternatives and similar repositories for adafactor
Users that are interested in adafactor are comparing it to the libraries listed below
Sorting:
- 高校赛2019 文本点击预测☆43Updated 5 years ago
- Adversarial Training for NLP in Keras☆46Updated 5 years ago
- 2019中国高校计算机大赛——大数据挑战赛 第一名解决方案☆42Updated 5 years ago
- saving memory by recomputing for keras☆37Updated 5 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- bert-of-theseus via bert4keras☆31Updated 5 years ago
- This is our solution for KDD Cup 2020. We implemented a very neat and simple neural ranking model based on siamese BERT which ranked firs…☆71Updated 5 years ago
- This is our solution for WSDM - DiggSci 2020. We implemented a simple yet robust search pipeline which ranked 2nd in the validation set a…☆63Updated 5 years ago
- tensorflow version of bert-of-theseus☆63Updated 4 years ago
- wrapping a keras optimizer to implement gradient accumulation☆119Updated 5 years ago
- 用bert4keras来解小学数学应用题☆77Updated 4 years ago
- RAdam optimizer for keras☆71Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆60Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆194Updated 3 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆141Updated 5 years ago
- 天池人工智能创新赛3-ch12hu团队周星星分享☆27Updated 4 years ago
- A pytorch implementation of Attention is all you need☆91Updated 6 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding 论文的中文翻译 Paper Chinese Translation!☆49Updated 6 years ago
- Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力…☆130Updated 4 years ago
- 基于capsule的观点型阅读理解模型☆89Updated 6 years ago
- CCF BDCI 2019 “技术需求”与“技术成果”项目之间关联度计算模型 复赛B榜top1解决方案☆77Updated 2 years ago
- a beautiful method for cluster or community detection☆51Updated 5 years ago
- Natural Language Procesing☆35Updated 4 years ago
- KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place☆192Updated 5 years ago
- bert4keras实现gpt下中国象棋☆46Updated 4 years ago
- top8 KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall☆37Updated 2 years ago
- 人人都能看懂的轻量级解决方案☆16Updated 5 years ago
- Kaggle新赛(baseline)-基于BERT的fine-tuning方案+基于tensor2tensor的Transformer Encoder方案☆61Updated 6 years ago
- ☆22Updated 7 years ago
- ESIM model with lanuage model☆27Updated 6 years ago