haoyuhu / bert-multi-gpu
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
☆192Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for bert-multi-gpu
- TensorFlow code and pre-trained models for BERT☆114Updated 4 years ago
- multi-gpu pre-training in one machine for BERT from scratch without horovod (Data Parallelism)☆173Updated last month
- BERT as language model, fork from https://github.com/google-research/bert☆247Updated 8 months ago
- Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆340Updated 5 years ago
- ☆215Updated 5 years ago
- export bert model for serving☆142Updated 5 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆200Updated last year
- ☆278Updated 3 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆230Updated 5 years ago
- ☆444Updated 2 years ago
- Collections of Chinese reading comprehension datasets☆214Updated 4 years ago
- Implementation of XLNet that can load pretrained checkpoints☆172Updated 2 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆145Updated 5 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆90Updated 4 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 4 years ago
- 基于BERT的中文序列标注☆142Updated 6 years ago
- TensorFlow implementation of the ESIM model (Enhanced LTSM for natural language inference)☆77Updated 5 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆309Updated 4 years ago
- 论文实现(ACL2019):《Matching the Blanks: Distributional Similarity for Relation Learning》☆154Updated last year
- A PyTorch implementation of Mnemonic Reader for the Machine Comprehension task☆136Updated 6 years ago
- Implementation of the ESIM model for natural language inference with PyTorch☆366Updated 3 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆177Updated 5 years ago
- Slot-Gated Modeling for Joint Slot Filling and Intent Prediction☆305Updated 3 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆120Updated last year
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆191Updated 2 years ago
- BertQA - Attention on Steroids☆115Updated 2 years ago
- Deep contextualized word representations for Chinese☆152Updated 5 years ago
- Data Augmentation for NLP. NLP数据增强☆294Updated 3 years ago
- question answering, reading comprehension toolkit☆167Updated 2 years ago