haoyuhu / bert-multi-gpuLinks
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
☆192Updated 5 years ago
Alternatives and similar repositories for bert-multi-gpu
Users that are interested in bert-multi-gpu are comparing it to the libraries listed below
Sorting:
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆172Updated 2 months ago
- TensorFlow code and pre-trained models for BERT☆114Updated 5 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆247Updated last year
- Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆337Updated 5 years ago
- ☆278Updated 4 years ago
- export bert model for serving☆141Updated 6 years ago
- ☆218Updated 5 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 5 years ago
- XLNet Extension in TensorFlow☆130Updated 4 years ago
- Collections of Chinese reading comprehension datasets☆217Updated 5 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆202Updated 2 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆145Updated 6 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆313Updated 4 years ago
- Neural word segmentation with rich pretraining, code for ACL 2017 paper☆164Updated 6 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆177Updated 5 years ago
- question answering, reading comprehension toolkit☆166Updated 2 years ago
- UNF(Universal NLP Framework)☆70Updated 5 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆230Updated 5 years ago
- Re-implementation of BIMPM (Bilateral Multi-Perspective Matching for Natural Language Sentences, Zhiguo Wang et al.) on Pytorch.☆103Updated 5 years ago
- NLU: domain-intent-slot; text2SQL☆74Updated 5 years ago
- 论文实现(ACL2019):《Matching the Blanks: Distributional Similarity for Relation Learning》☆154Updated 2 years ago
- Leaderboards, Datasets and Papers for Multi-Turn Response Selection in Retrieval-Based Chatbots☆203Updated 4 years ago
- TensorFlow implementation of the ESIM model (Enhanced LTSM for natural language inference)☆76Updated 6 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆66Updated 6 years ago
- 基于BERT的中文序列标注☆141Updated 6 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆92Updated 5 years ago
- Data Augmentation for NLP. NLP数据增强☆295Updated 4 years ago
- Slot-Gated Modeling for Joint Slot Filling and Intent Prediction☆304Updated 4 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆140Updated 5 years ago
- Dataset for CIKM 2018 paper "Multi-Source Pointer Network for Product Title Summarization"☆73Updated 6 years ago