luozhouyang / transformers-kerasLinks
Transformer-based models implemented in tensorflow 2.x(using keras).
☆75Updated 3 years ago
Alternatives and similar repositories for transformers-keras
Users that are interested in transformers-keras are comparing it to the libraries listed below
Sorting:
- Implementation of XLNet that can load pretrained checkpoints☆171Updated 3 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆177Updated 5 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 5 years ago
- Named Entity Recognition (NER) task using Bi-LSTM-CRF model implemented in Tensorflow 2.0(tensorflow2.0 +)☆119Updated 5 years ago
- A Lite BERT☆59Updated 5 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆92Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Updated 3 years ago
- Keras solution of Chinese NER task using BiLSTM-CRF/BiGRU-CRF/IDCNN-CRF model with Pretrained Language Model: supporting BERT/RoBERTa/ALB…☆12Updated 2 years ago
- ☆91Updated 5 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆62Updated 4 years ago
- datagrand 2019 information extraction competition rank9☆130Updated 5 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆230Updated 5 years ago
- ☆279Updated 4 years ago
- bert/albert/roberta特征抽取服务端,基于bert-as-service,新增albert模型。☆1Updated 2 years ago
- ☆89Updated 5 years ago
- 整理一下在keras中使用T5模型的要点☆172Updated 3 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆140Updated 5 years ago
- Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆338Updated 5 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆172Updated 4 months ago
- keras implement of dgcnn for reading comprehension☆164Updated 5 years ago
- 基于tensorflow1.x的预训练模型调用,支持单机多卡、梯度累积,XLA加速,混合精度。可灵活训练、验证、预测。☆58Updated 3 years ago
- Bert-classification and bert-dssm implementation with keras.☆93Updated 5 years ago
- 基于BERT的中文序列标注☆141Updated 6 years ago
- tensorflow version of bert-of-theseus☆62Updated 4 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆313Updated 5 years ago
- Hierarchically-Refined Label Attention Network for Sequence Labeling☆292Updated 4 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆183Updated 5 years ago
- use ELMo in chinese environment☆104Updated 6 years ago
- Byte Cup 2018 International Machine Learning Contest (3rd prize)☆77Updated 2 years ago
- 达观算法比赛ner任务,从重新训练bert,到finetune预测。☆74Updated 2 years ago