yinmingjun / TinyBERTLinks
☆94Updated 6 years ago
Alternatives and similar repositories for TinyBERT
Users that are interested in TinyBERT are comparing it to the libraries listed below
Sorting:
- ☆167Updated 4 years ago
- Pretrain CPM-1☆52Updated 4 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆203Updated 6 years ago
- Finetune CPM-1☆75Updated 2 years ago
- Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"☆137Updated 2 years ago
- RoFormer升级版☆154Updated 3 years ago
- ParaGen is a PyTorch deep learning framework for parallel sequence generation.☆185Updated 3 years ago
- FLASHQuad_pytorch☆68Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 4 years ago
- ☆23Updated 5 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆124Updated 8 months ago
- pytorch版simcse无监督语义相似模型☆23Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- ☆120Updated 4 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Updated 2 years ago
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆48Updated 3 years ago
- ☆54Updated 3 years ago
- MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices☆71Updated 5 years ago
- a simple yet complete implementation of the popular BERT model☆128Updated 5 years ago
- Code for CPM-2 Pre-Train☆158Updated 2 years ago
- A Dataset for Multi-Turn Dialogue Reasoning☆332Updated 5 years ago
- modification of official bert for downstream task☆32Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated 2 years ago
- ROUGE for multilingual Summarization☆25Updated 4 years ago
- ☆50Updated 2 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆270Updated 5 years ago
- Chinese Transformer Generative Pre-Training Model☆59Updated 6 years ago
- Finetune CPM-2☆81Updated 2 years ago
- Codes for the paper "Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding" (ACL-IJCNLP 2021)☆41Updated 4 years ago
- A pre-trained model with multi-exit transformer architecture.☆56Updated 3 years ago