yinmingjun / TinyBERTLinks
☆93Updated 6 years ago
Alternatives and similar repositories for TinyBERT
Users that are interested in TinyBERT are comparing it to the libraries listed below
Sorting:
- ☆167Updated 4 years ago
- Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"☆137Updated 2 years ago
- Pretrain CPM-1☆52Updated 4 years ago
- Finetune CPM-1☆75Updated 2 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆123Updated 7 months ago
- A pre-trained model with multi-exit transformer architecture.☆56Updated 3 years ago
- ☆54Updated 3 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- ParaGen is a PyTorch deep learning framework for parallel sequence generation.☆185Updated 3 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆198Updated 2 years ago
- A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆119Updated 4 years ago
- ☆120Updated 4 years ago
- Introduction to CPM☆165Updated 4 years ago
- Code for CPM-2 Pre-Train☆158Updated 2 years ago
- A Dataset for Multi-Turn Dialogue Reasoning☆333Updated 5 years ago
- ☆50Updated 2 years ago
- 💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.☆181Updated 3 years ago
- pytorch implementation for Patient Knowledge Distillation for BERT Model Compression☆204Updated 6 years ago
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆48Updated 3 years ago
- 大规模中文语料☆44Updated 6 years ago
- ☆254Updated 3 years ago
- RoFormer升级版☆154Updated 3 years ago
- Finetune CPM-2☆81Updated 2 years ago
- FLASHQuad_pytorch☆68Updated 3 years ago
- A PyTorch-based model pruning toolkit for pre-trained language models☆388Updated 2 years ago
- Official repository of the AAAI'2022 paper "GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning…☆108Updated 3 years ago
- ChID: A Large-scale Chinese IDiom Dataset for Cloze Test☆151Updated 2 years ago
- ☆82Updated 2 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆268Updated 5 years ago
- Open source code for EMNLP 2020 Findings Paper "AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slo…☆87Updated 4 years ago