google-research / adapter-bert
☆489Updated last year
Alternatives and similar repositories for adapter-bert:
Users that are interested in adapter-bert are comparing it to the libraries listed below
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆729Updated 2 years ago
- Prefix-Tuning: Optimizing Continuous Prompts for Generation☆919Updated 11 months ago
- ☆397Updated 3 years ago
- A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.☆930Updated 2 years ago
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆526Updated 3 years ago
- ☆876Updated 11 months ago
- ☆291Updated 2 years ago
- Code for using and evaluating SpanBERT.☆897Updated last year
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆311Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆471Updated last year
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆529Updated 3 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆171Updated 5 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆293Updated 2 years ago
- ☆344Updated 3 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- Interpretable Evaluation for AI Systems☆364Updated 2 years ago
- This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…☆312Updated 3 years ago
- A research project for natural language generation, containing the official implementations by MSRA NLC team.☆718Updated 9 months ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆242Updated 3 years ago
- ☆462Updated 4 years ago
- Plot the vector graph of attention based text visualisation☆373Updated 6 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆263Updated 2 years ago
- ☆318Updated 3 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆307Updated 2 years ago
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆533Updated 3 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆258Updated last year
- ☆218Updated 4 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆251Updated 3 years ago
- Adversarial Training for Natural Language Understanding☆252Updated last year