Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models"
☆39Jan 13, 2025Updated last year
Alternatives and similar repositories for SAPT
Users that are interested in SAPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation☆12May 16, 2023Updated 2 years ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆26Oct 8, 2024Updated last year
- ☆199Jul 13, 2024Updated last year
- [ACL2024] A Codebase for Incremental Learning with Large Language Models; Official released code for "Learn or Recall? Revisiting Increme…☆60Feb 1, 2025Updated last year
- Code for "Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal" (ACL 2024)☆16Oct 21, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆70May 30, 2025Updated 10 months ago
- ☆29Apr 7, 2024Updated 2 years ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.☆24Feb 20, 2025Updated last year
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆92Jan 24, 2024Updated 2 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- A Comprehensive Survey on Continual Learning in Generative Models.☆141Feb 14, 2026Updated last month
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆29Oct 9, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- [ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Model…☆161May 30, 2025Updated 10 months ago
- Progressive Prompts: Continual Learning for Language Models☆96Apr 24, 2023Updated 2 years ago
- ☆14Jan 10, 2025Updated last year
- [CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey☆536Dec 23, 2025Updated 3 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆149Nov 14, 2023Updated 2 years ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆403Apr 29, 2024Updated last year
- AAAI 2022 paper - Unifying Model Explainability and Robustness for Joint Text Classification and Rationale Extraction☆17Dec 23, 2021Updated 4 years ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆179Jan 29, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆52Nov 10, 2024Updated last year
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆41Apr 3, 2025Updated last year
- EraseAnything, ICML 2025☆40Sep 28, 2025Updated 6 months ago
- Pytorch Implementation for "Preserving Linear Separability in Continual Learning by Backward Feature Projection" (CVPR 2023)☆18Jun 29, 2023Updated 2 years ago
- ☆11Jul 4, 2024Updated last year
- code and resources for our paper "Achieving Joint Training Accuracy in Continual Learning" in AAAI2025☆14Feb 25, 2025Updated last year
- MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks (ICML 2023)☆11Aug 15, 2023Updated 2 years ago
- ☆29Sep 16, 2025Updated 6 months ago
- An Extensible Continual Learning Framework Focused on Language Models (LMs)☆292Jan 28, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated last year
- Continual Learning for Task-Oriented Dialogue Systems☆30Apr 21, 2022Updated 3 years ago
- ☆20Feb 2, 2026Updated 2 months ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- Documentation at☆14Mar 27, 2025Updated last year
- Code for MERL's ECCV 2022 paper on Cross-Modal Knowledge Transfer Without Task-Relevant Source Data☆10Jul 19, 2022Updated 3 years ago