lambdal / bertLinks
TensorFlow code and pre-trained models for BERT
☆116Updated 5 years ago
Alternatives and similar repositories for bert
Users that are interested in bert are comparing it to the libraries listed below
Sorting:
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Updated 5 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Updated 6 months ago
- TensorFlow code and pre-trained models for BERT☆17Updated 6 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆204Updated 2 years ago
- question answering, reading comprehension toolkit☆166Updated 2 years ago
- A PyTorch implementation of Mnemonic Reader for the Machine Comprehension task☆135Updated 6 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆248Updated last year
- XLNet Extension in TensorFlow☆131Updated 4 years ago
- Multi-class metrics for Tensorflow☆223Updated 3 years ago
- ☆443Updated 3 years ago
- Re-implementation of BIMPM (Bilateral Multi-Perspective Matching for Natural Language Sentences, Zhiguo Wang et al.) on Pytorch.☆103Updated 5 years ago
- TensorFlow code and pre-trained models for BERT☆24Updated 6 years ago
- NLU: domain-intent-slot; text2SQL☆74Updated 5 years ago
- Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems☆352Updated 2 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121Updated 2 years ago
- Slot-Gated Modeling for Joint Slot Filling and Intent Prediction☆304Updated 4 years ago
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆121Updated 5 years ago
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆323Updated 4 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆315Updated 2 years ago
- Which Encoding is the Best for Text Classification in Chinese, English, Japanese and Korean?☆173Updated 7 years ago
- ☆220Updated 5 years ago
- Enhanced LTSM for natural language inference☆265Updated 5 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 5 years ago
- Dataset for CIKM 2018 paper "Multi-Source Pointer Network for Product Title Summarization"☆73Updated 7 years ago
- Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆339Updated 6 years ago
- CopyNet Implementation with Tensorflow and nmt☆122Updated 6 years ago
- R-net in PyTorch, with ELMo☆198Updated 5 years ago
- ☆28Updated 7 years ago
- Neural network toolkit for sentence pair modeling.☆304Updated 4 years ago
- Fully Statistical Neural Belief Tracker (Mrkšić and Vulić, ACL 2018)☆166Updated last year