BeyonderXX / TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
☆67Updated last year
Alternatives and similar repositories for TRACE:
Users that are interested in TRACE are comparing it to the libraries listed below
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆34Updated 3 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆60Updated last year
- ☆50Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆107Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆41Updated 5 months ago
- ☆172Updated 9 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆38Updated 2 weeks ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆30Updated 4 months ago
- Directional Preference Alignment☆56Updated 6 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆68Updated last year
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆58Updated 4 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆39Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆74Updated 5 months ago
- ☆131Updated 8 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆33Updated last month
- ☆91Updated last month
- ☆37Updated last year
- ☆50Updated this week
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆164Updated 9 months ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆25Updated 6 months ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆100Updated 2 years ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆43Updated 6 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆74Updated 3 months ago
- ☆39Updated last year
- my commonly-used tools☆51Updated 3 months ago
- [ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"☆97Updated last year
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆133Updated 2 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆130Updated last year