haoyuhu / bert-multi-gpuLinks
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
☆192Updated 5 years ago
Alternatives and similar repositories for bert-multi-gpu
Users that are interested in bert-multi-gpu are comparing it to the libraries listed below
Sorting:
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Updated 8 months ago
- TensorFlow code and pre-trained models for BERT☆116Updated 5 years ago
- Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆340Updated 6 years ago
- export bert model for serving☆141Updated 7 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆249Updated last year
- TensorFlow code and pre-trained models for BERT and ERNIE☆147Updated 6 years ago
- ☆443Updated 3 years ago
- ☆280Updated 5 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆315Updated 5 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆231Updated 6 years ago
- Collections of Chinese reading comprehension datasets☆221Updated 5 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆178Updated 6 years ago
- Slot-Gated Modeling for Joint Slot Filling and Intent Prediction☆306Updated 4 years ago
- Deep contextualized word representations for Chinese☆151Updated 6 years ago
- TensorFlow implementation of the ESIM model (Enhanced LTSM for natural language inference)☆77Updated 6 years ago
- question answering, reading comprehension toolkit☆166Updated 3 years ago
- Data Augmentation for NLP. NLP数据增强☆295Updated 5 years ago
- Dataset for CIKM 2018 paper "Multi-Source Pointer Network for Product Title Summarization"☆73Updated 7 years ago
- ☆124Updated 6 years ago
- Implementation of the ESIM model for natural language inference with PyTorch☆374Updated 4 years ago
- A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)☆127Updated 3 years ago
- ☆220Updated 6 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆95Updated 6 years ago
- BERT for Multitask Learning☆548Updated 2 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆66Updated 5 years ago
- baseline system of knowledge driven dialogue competition☆269Updated 6 years ago
- The code of ACL 2019 paper: Matching Article Pairs with Graphical Decomposition and Convolutions☆236Updated 5 years ago
- Neural word segmentation with rich pretraining, code for ACL 2017 paper☆164Updated 6 years ago
- Enhanced LTSM for natural language inference☆265Updated 5 years ago