UIC-Liu-Lab / CPT
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
☆40Updated last year
Related projects: ⓘ
- [EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge☆19Updated last year
- ☆32Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆61Updated last year
- Methods and evaluation for aligning language models temporally☆24Updated 6 months ago
- ☆49Updated last year
- ☆32Updated 5 months ago
- The code for lifelong few-shot language learning☆53Updated 2 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆38Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆36Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆96Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆68Updated last year
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆42Updated last year
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆41Updated 2 years ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆56Updated 7 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆51Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆46Updated 2 years ago
- ☆26Updated last year
- ☆23Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆22Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆59Updated last year
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆66Updated 6 months ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- ☆39Updated 9 months ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆19Updated last year
- ☆36Updated 5 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆45Updated 5 months ago
- Active Example Selection for In-Context Learning (EMNLP'22)☆43Updated last month
- ☆77Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆38Updated 10 months ago