UIC-Liu-Lab / DGALinks
[EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge
☆21Updated 2 years ago
Alternatives and similar repositories for DGA
Users that are interested in DGA are comparing it to the libraries listed below
Sorting:
- [EMNLP 2022] Continual Training of Language Models for Few-Shot Learning☆45Updated 2 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆104Updated 3 years ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Updated last year
- TBC☆28Updated 3 years ago
- Code for Editing Factual Knowledge in Language Models☆142Updated 4 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆70Updated 3 years ago
- ☆23Updated 2 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆42Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Updated 2 years ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Updated 2 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Updated 3 years ago
- ☆75Updated 2 years ago
- Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)☆113Updated 3 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- The code for lifelong few-shot language learning☆55Updated 3 years ago
- ☆54Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Updated 2 years ago
- ☆83Updated 2 years ago
- ☆48Updated 2 years ago
- ☆88Updated 3 years ago
- contrastive decoding☆206Updated 3 years ago
- Retrieval as Attention☆82Updated 3 years ago
- ☆55Updated last year
- ☆35Updated 4 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆71Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28Updated 2 years ago
- ☆41Updated 2 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆102Updated 2 years ago
- ☆58Updated 3 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Updated last year