guotong1988 / BERT-GPU
multi-gpu pre-training in one machine for BERT from scratch without horovod (Data Parallelism)
☆173Updated last month
Related projects ⓘ
Alternatives and complementary repositories for BERT-GPU
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Updated 4 years ago
- ☆214Updated 5 years ago
- export bert model for serving☆142Updated 5 years ago
- ☆279Updated 3 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆90Updated 4 years ago
- A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)☆126Updated 2 years ago
- 基于BERT的中文序列标注☆142Updated 6 years ago
- Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆340Updated 5 years ago
- Rank2 solution (no-BERT) for 2019 Language and Intelligence Challenge - DuReader2.0 Machine Reading Comprehension.☆128Updated 5 years ago
- An Implementation of 'Attention is all you need' with Chinese Corpus☆130Updated 6 months ago
- TensorFlow implementation of the ESIM model (Enhanced LTSM for natural language inference)☆77Updated 5 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 4 years ago
- TensorFlow code and pre-trained models for BERT☆114Updated 4 years ago
- ☆89Updated 4 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆247Updated 8 months ago
- use ELMo in chinese environment☆104Updated 6 years ago
- Deep contextualized word representations for Chinese☆152Updated 5 years ago
- baseline system of knowledge driven dialogue competition☆270Updated 5 years ago
- Dataset for CIKM 2018 paper "Multi-Source Pointer Network for Product Title Summarization"☆73Updated 6 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆183Updated 4 years ago
- BertQA - Attention on Steroids☆115Updated 2 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 4 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning☆51Updated 5 years ago
- Collections of Chinese reading comprehension datasets☆214Updated 4 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆230Updated 5 years ago
- Re-implementation of BIMPM (Bilateral Multi-Perspective Matching for Natural Language Sentences, Zhiguo Wang et al.) on Pytorch.☆106Updated 5 years ago
- question answering, reading comprehension toolkit☆167Updated 2 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆145Updated 5 years ago
- Slot-Gated Modeling for Joint Slot Filling and Intent Prediction☆305Updated 3 years ago
- 论文实现(ACL2019):《Matching the Blanks: Distributional Similarity for Relation Learning》☆155Updated last year