xydaytoy / EVA
☆11Updated 9 months ago
Alternatives and similar repositories for EVA:
Users that are interested in EVA are comparing it to the libraries listed below
- ☆70Updated 3 weeks ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆22Updated last week
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆58Updated last year
- Codes for Merging Large Language Models☆27Updated 5 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆18Updated 2 months ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆24Updated 9 months ago
- ☆21Updated last year
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆29Updated last week
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆14Updated 11 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆102Updated 9 months ago
- Language Imbalance Driven Rewarding for Multilingual Self-improving☆13Updated 2 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆39Updated 2 months ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆18Updated last year
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆47Updated last month
- A method of ensemble learning for heterogeneous large language models.☆33Updated 5 months ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Updated 9 months ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆19Updated 7 months ago
- The official code repository for PRMBench.☆57Updated this week
- LoFiT: Localized Fine-tuning on LLM Representations☆30Updated last week
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆14Updated last month
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models".☆39Updated 2 months ago
- Official implementation of our paper "Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Opera…☆11Updated 4 months ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆38Updated last year
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)☆32Updated 6 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆74Updated 3 months ago
- SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆31Updated last month
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆66Updated last week
- ☆46Updated 2 months ago
- ☆28Updated last year
- 🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆64Updated last month