fastnlp / ElasticBERTLinks
A pre-trained model with multi-exit transformer architecture.
☆56Updated 3 years ago
Alternatives and similar repositories for ElasticBERT
Users that are interested in ElasticBERT are comparing it to the libraries listed below
Sorting:
- Paradigm shift in natural language processing☆42Updated 3 years ago
- ☆99Updated 3 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- ☆54Updated 3 years ago
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆74Updated 3 years ago
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆48Updated 3 years ago
- ACL 2021: HiTransformer☆13Updated 4 years ago
- ☆116Updated 3 years ago
- reStructured Pre-training☆99Updated 3 years ago
- The unified platform for data-related resources.☆135Updated 2 years ago
- ☆80Updated 3 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 3 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆57Updated 3 years ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35Updated last year
- ☆67Updated 4 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- Codes for the paper "Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding" (ACL-IJCNLP 2021)☆41Updated 4 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Updated 4 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Updated 3 years ago
- This project maintains a reading list for general text generation tasks☆66Updated 4 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆105Updated 3 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆29Updated 2 years ago
- ☆45Updated 4 years ago
- ☆69Updated 3 years ago
- ROUGE for multilingual Summarization☆25Updated 4 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆68Updated 4 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 3 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆121Updated 2 years ago