xydaytoy / EVA
☆11Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for EVA
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆58Updated 11 months ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆22Updated 7 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆36Updated 3 weeks ago
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆25Updated 5 months ago
- Codes for Merging Large Language Models☆25Updated 3 months ago
- [ICLR 2024] Code for the paper "Sparse MoE with Language-Guided Routing for Multilingual Machine Translation"☆8Updated 6 months ago
- ☆48Updated this week
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆16Updated last year
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆25Updated last month
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆14Updated 3 weeks ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆36Updated this week
- ☆44Updated 10 months ago
- ☆14Updated 5 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆15Updated last week
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆15Updated 6 months ago
- ☆12Updated 8 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆97Updated 7 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆24Updated 4 months ago
- ☆39Updated last month
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆52Updated last week
- ☆11Updated 6 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆38Updated last year
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆28Updated 4 months ago
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆24Updated last month
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆16Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆86Updated 2 months ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆18Updated 5 months ago
- [EMNLP 2024 Findings] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models☆19Updated last week
- ☆65Updated 2 months ago