TensorFlow code and pre-trained models for BERT
☆116Mar 11, 2020Updated 5 years ago
Alternatives and similar repositories for bert
Users that are interested in bert are comparing it to the libraries listed below
Sorting:
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Mar 9, 2020Updated 5 years ago
- TensorFlow code and pre-trained models for BERT☆24Apr 19, 2019Updated 6 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Dec 27, 2025Updated 2 months ago
- ☆28May 31, 2018Updated 7 years ago
- 21th place (top2%) solution for kaggle TensorFlow 2.0 Question Answering☆16Feb 7, 2020Updated 6 years ago
- BERT for Multitask Learning☆544Apr 12, 2023Updated 2 years ago
- Implementation of the ACL Findings paper "OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack"☆10May 24, 2021Updated 4 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆314Jul 30, 2020Updated 5 years ago
- 基于方差权重因子选词的SIF句向量模型-实验源码☆11Mar 8, 2020Updated 5 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆230Dec 4, 2020Updated 5 years ago
- code for EMNLP2018 paper 'Associative-multichannel-autoencoder for multimodal word representation'☆13Aug 24, 2018Updated 7 years ago
- TensorFlow code and pre-trained models for BERT☆11May 2, 2019Updated 6 years ago
- ICLR 2018 Quick-Thought vectors☆204Jul 15, 2019Updated 6 years ago
- Code to reproduce the paper Working Memory Networks☆26Jun 28, 2018Updated 7 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- Thin wrapper for the AllenNLP's implementation of supervised open information extraction☆17Nov 19, 2019Updated 6 years ago
- Transfomer based implementation of "An Efficient Framework For Learning Sentence Representation" Logeswaran et al☆13Aug 5, 2018Updated 7 years ago
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Apr 16, 2021Updated 4 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,273May 19, 2022Updated 3 years ago
- export bert model for serving☆141Dec 12, 2018Updated 7 years ago
- lattice lstm cell implementation with tensorflow☆30Aug 3, 2018Updated 7 years ago
- ☆15Jul 17, 2020Updated 5 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year
- 高质量闲聊数据介绍☆30Dec 12, 2018Updated 7 years ago
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,848Mar 21, 2021Updated 4 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆204Mar 24, 2023Updated 2 years ago
- The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.☆99Oct 17, 2018Updated 7 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,156Jan 22, 2024Updated 2 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,455May 21, 2023Updated 2 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,675Dec 1, 2025Updated 3 months ago
- Partial Codes and datasets for NeurIPS'19 "Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers"☆20Nov 1, 2019Updated 6 years ago
- ☆16Aug 6, 2018Updated 7 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- Dilation Gate CNN For Machine Reading Comprehension☆17Mar 24, 2023Updated 2 years ago
- TensorFlow code and pre-trained models for BERT☆17Feb 28, 2019Updated 7 years ago
- NLU: domain-intent-slot; text2SQL☆74Apr 18, 2020Updated 5 years ago
- ☆18Oct 16, 2020Updated 5 years ago
- Frequently used machine learning algorithms implemented in C++☆17Oct 1, 2020Updated 5 years ago