guotong1988/BERT-pre-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guotong1988/BERT-pre-training)

guotong1988 / BERT-pre-training

multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)

☆171

Alternatives and similar repositories for BERT-pre-training

Users that are interested in BERT-pre-training are comparing it to the libraries listed below

Sorting:

haoyuhu / bert-multi-gpu
View on GitHub
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
☆192Mar 9, 2020Updated 6 years ago
zhp510730568 / bert-ad
View on GitHub
bert multiple gpu train pretrain
☆29Apr 12, 2020Updated 5 years ago
JayYip / m3tl
View on GitHub
BERT for Multitask Learning
☆543Apr 12, 2023Updated 2 years ago
hankcs / distributed-bert
View on GitHub
TensorFlow code and pre-trained models for BERT
☆11May 2, 2019Updated 6 years ago
abditag2 / bert
View on GitHub
TensorFlow code and pre-trained models for BERT
☆24Apr 19, 2019Updated 6 years ago
lambdal / bert
View on GitHub
TensorFlow code and pre-trained models for BERT
☆116Mar 11, 2020Updated 5 years ago
Jiakui / awesome-bert
View on GitHub
bert nlp papers, applications and github resources, including the newst xlnet ， BERT、XLNet 相关论文和 github 项目
☆1,848Mar 21, 2021Updated 4 years ago
zhihu / cuBERT
View on GitHub
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
☆547Nov 18, 2020Updated 5 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,774Jul 22, 2024Updated last year
brightmart / xlnet_zh
View on GitHub
中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large
☆229Sep 13, 2019Updated 6 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,983Nov 21, 2022Updated 3 years ago
ymcui / Chinese-XLNet
View on GitHub
Pre-Trained Chinese XLNet（中文XLNet预训练模型）
☆1,650Jul 15, 2025Updated 7 months ago
practicingman / bert_serving
View on GitHub
export bert model for serving
☆141Dec 12, 2018Updated 7 years ago
TJUNLP / COER
View on GitHub
Chinese Open Entity-Relation Knowledge Base
☆36May 22, 2018Updated 7 years ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,440Jul 15, 2025Updated 7 months ago
google-research / albert
View on GitHub
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
☆3,276Apr 14, 2023Updated 2 years ago
loujie0822 / Pre-trained-Models
View on GitHub
预训练语言模型综述
☆548Mar 25, 2020Updated 5 years ago
sfzhou5678 / PretrainedLittleBERTs
View on GitHub
24*2个预训练的小型BERT模型，NLPer炼丹利器
☆51Apr 12, 2020Updated 5 years ago
qiangsiwei / bert_distill
View on GitHub
BERT distillation（基于BERT的蒸馏实验）
☆314Jul 30, 2020Updated 5 years ago
macanv / BERT-BiLSTM-CRF-NER
View on GitHub
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
☆4,901Feb 24, 2021Updated 5 years ago
thunlp / ERNIE
View on GitHub
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
☆1,417Jan 10, 2024Updated 2 years ago
YC-wind / embedding_study
View on GitHub
中文预训练模型生成字向量学习，测试BERT，ELMO的中文效果
☆100Jan 22, 2020Updated 6 years ago
CyberZHG / keras-bert
View on GitHub
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
☆2,425Jan 22, 2022Updated 4 years ago
BOUALILILila / markers_bert
View on GitHub
☆25May 4, 2022Updated 3 years ago
Nealcly / BiLSTM-LAN
View on GitHub
Hierarchically-Refined Label Attention Network for Sequence Labeling
☆293Apr 9, 2021Updated 4 years ago
brightmart / bert_language_understanding
View on GitHub
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
☆967Jan 1, 2019Updated 7 years ago
saurabhkulkarni77 / DistillBERT
View on GitHub
☆61Nov 14, 2019Updated 6 years ago
pengming617 / text_matching
View on GitHub
文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等，数据集为LCQMC官方数据
☆470May 8, 2022Updated 3 years ago
shoarora / transformers-trainers
View on GitHub
Tools for training pytorch language models
☆27Nov 14, 2020Updated 5 years ago
helenapril / deep-Chinese-SRL
View on GitHub
semantic role labeling based on deep learning, implemented by tensorflow
☆16Aug 20, 2018Updated 7 years ago
pengming617 / bert_textMatching
View on GitHub
利用预训练的中文模型实现基于bert的语义匹配模型数据集为LCQMC官方数据
☆198Dec 19, 2019Updated 6 years ago
microsoft / MASS
View on GitHub
MASS: Masked Sequence to Sequence Pre-training for Language Generation
☆1,122Nov 28, 2022Updated 3 years ago
lonePatient / daguan_2019_rank9
View on GitHub
datagrand 2019 information extraction competition rank9
☆130Dec 29, 2019Updated 6 years ago
nlpjoe / 2018-CCL-UIIMCS
View on GitHub
CCL2018客服领域用户意图分类冠军1st方案
☆149Sep 5, 2022Updated 3 years ago
bojone / oppo-text-match
View on GitHub
小布助手对话短文本语义匹配的一个baseline
☆138Mar 8, 2021Updated 5 years ago
panchunguang / ccks_baidu_entity_link
View on GitHub
ccks baidu entity link 实体链接第一名
☆842Dec 19, 2023Updated 2 years ago
bojone / word-discovery
View on GitHub
速度更快、效果更好的中文新词发现
☆513Mar 15, 2024Updated last year
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,176May 28, 2023Updated 2 years ago
CLUEbenchmark / CLUEPretrainedModels
View on GitHub
高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型
☆816Jul 8, 2020Updated 5 years ago