UIC-Liu-Lab / CPT
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
☆44Updated 2 years ago
Alternatives and similar repositories for CPT:
Users that are interested in CPT are comparing it to the libraries listed below
- [EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge☆21Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆32Updated 2 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆100Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆69Updated last year
- ☆25Updated last year
- ☆52Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆46Updated 3 years ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆64Updated last year
- ☆32Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 9 months ago
- The code for lifelong few-shot language learning☆55Updated 3 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Updated 2 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆38Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆38Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 7 months ago
- Methods and evaluation for aligning language models temporally☆27Updated 11 months ago
- ☆40Updated last year
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆44Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆38Updated 8 months ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated 6 months ago
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆52Updated last year
- ☆47Updated 10 months ago
- ☆85Updated 2 years ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆58Updated 3 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆74Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆43Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"☆44Updated 2 years ago