yitu-opensource / ConvBertLinks

☆254

Alternatives and similar repositories for ConvBert

Users that are interested in ConvBert are comparing it to the libraries listed below

Sorting:

JetRunner / BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆315Updated 2 years ago
lonePatient / electra_pytorch
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆91Updated 4 years ago
microsoft / MPNet
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆292Updated 4 years ago
guolinke / TUPE
Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…
☆252Updated 3 years ago
asappresearch / revisit-bert-finetuning
For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).
☆184Updated 2 years ago
graykode / ALBERT-Pytorch
Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)
☆227Updated 4 years ago
fastnlp / TENER
Codes for "TENER: Adapting Transformer Encoder for Named Entity Recognition"
☆378Updated 5 years ago
autoliuweijie / FastBERT
The score code of FastBERT (ACL2020)
☆609Updated 3 years ago
ShannonAI / dice_loss_for_NLP
The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`
☆274Updated 2 years ago
yaoxingcheng / TLM
ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework
☆255Updated last year
Sanyuan-Chen / RecAdam
Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.
☆118Updated 4 years ago
linzehui / mRASP
☆168Updated 3 years ago
joongbo / tta
Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"
☆110Updated 4 years ago
llamazing / numnet_plus
This is the official code repository for NumNet+(https://leaderboard.allenai.org/drop/submission/blu418v76glsbnh1qvd0)
☆176Updated last year
TsinghuaAI / CPM-1-Finetune
Finetune CPM-1
☆75Updated 2 years ago
bohanli / BERT-flow
TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)
☆534Updated 4 years ago
laiguokun / Funnel-Transformer
☆219Updated 5 years ago
microsoft / fastseq
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆432Updated 3 years ago
alexa / bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆471Updated 3 years ago
MC-BERT / MC-BERT
☆97Updated 5 years ago
yzh119 / BPT
Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"
☆128Updated 4 years ago
rikeda71 / TorchCRF
An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0
☆137Updated 5 years ago
mawentao277 / CharBERT
CharBERT: Character-aware Pre-trained Language Model (COLING2020)
☆121Updated 4 years ago
lipiji / Guyu
Chinese GPT2: pre-training and fine-tuning framework for text generation
☆187Updated 4 years ago
bytedance / ParaGen
ParaGen is a PyTorch deep learning framework for parallel sequence generation.
☆185Updated 2 years ago
ymcui / LAMB_Optimizer_TF
LAMB Optimizer for Large Batch Training (TensorFlow version)
☆121Updated 5 years ago
pmichel31415 / are-16-heads-really-better-than-1
Code for the paper "Are Sixteen Heads Really Better than One?"
☆172Updated 5 years ago
microsoft / MT-DNN
Multi-Task Deep Neural Networks for Natural Language Understanding
☆163Updated 2 years ago
microsoft / Unicoder
Unicoder model for understanding and generation.
☆91Updated last year
zhuchen03 / FreeLB
Adversarial Training for Natural Language Understanding
☆253Updated 2 years ago