TensorFlow code and pre-trained models for BERT
☆117Mar 11, 2020Updated 6 years ago
Alternatives and similar repositories for bert
Users that are interested in bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Feel free to fine tune large BERT models with Multi-GPU and FP16 support.☆192Mar 9, 2020Updated 6 years ago
- TensorFlow code and pre-trained models for BERT☆24Apr 19, 2019Updated 7 years ago
- multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)☆171Dec 27, 2025Updated 4 months ago
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- 21th place (top2%) solution for kaggle TensorFlow 2.0 Question Answering☆16Feb 7, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28May 31, 2018Updated 7 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆230Dec 4, 2020Updated 5 years ago
- TensorFlow code and pre-trained models for BERT☆11May 2, 2019Updated 7 years ago
- BERT for Multitask Learning☆544Apr 12, 2023Updated 3 years ago
- Transfomer based implementation of "An Efficient Framework For Learning Sentence Representation" Logeswaran et al☆13Aug 5, 2018Updated 7 years ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- BERT distillation(基于BERT的蒸馏实验 )☆317Jul 30, 2020Updated 5 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,175May 28, 2023Updated 2 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,690Dec 1, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- code for EMNLP2018 paper 'Associative-multichannel-autoencoder for multimodal word representation'☆13Aug 24, 2018Updated 7 years ago
- Code to reproduce the paper Working Memory Networks☆26Jun 28, 2018Updated 7 years ago
- ☆15Jul 17, 2020Updated 5 years ago
- ☆21Nov 29, 2022Updated 3 years ago
- ☆18Oct 16, 2020Updated 5 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,278May 19, 2022Updated 4 years ago
- ☆16Aug 6, 2018Updated 7 years ago
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,844Mar 21, 2021Updated 5 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,369Mar 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.☆99Oct 17, 2018Updated 7 years ago
- Ongoing research training transformer language models at scale, including: BERT☆16Apr 25, 2019Updated 7 years ago
- ICLR 2018 Quick-Thought vectors☆204Jul 15, 2019Updated 6 years ago
- export bert model for serving☆141Dec 12, 2018Updated 7 years ago
- Implementation of the ACL Findings paper "OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack"☆10May 24, 2021Updated 5 years ago
- Dynamic resources changes for multi-dimensional parallelism training☆31Aug 22, 2025Updated 9 months ago
- ☆22Jun 5, 2019Updated 6 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,473May 21, 2023Updated 3 years ago
- ALBERT model Pretraining and Fine Tuning using TF2.0☆204Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,162Jan 22, 2024Updated 2 years ago
- SlotRefine: A Fast Non-Autoregressive Model forJoint Intent Detection and Slot Filling☆48Apr 27, 2021Updated 5 years ago
- 基于方差权重因子选词的SIF句向量模型-实验源码☆11Mar 8, 2020Updated 6 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,700May 8, 2023Updated 3 years ago
- NLU: domain-intent-slot; text2SQL☆74Apr 18, 2020Updated 6 years ago
- Implementation of the Paper "Entity Linking in Web Tables with Multiple Linked Knowledge Bases"☆10Oct 27, 2017Updated 8 years ago
- code examples of experimenting with the Rvision package☆12Feb 13, 2018Updated 8 years ago