llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆197Updated 7 months ago
Alternatives and similar repositories for LLM-Merging
Users that are interested in LLM-Merging are comparing it to the libraries listed below
Sorting:
- Function Vectors in Large Language Models (ICLR 2024)☆166Updated 3 weeks ago
- ☆177Updated last year
- ☆97Updated 10 months ago
- ☆171Updated 3 weeks ago
- ☆94Updated last year
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆161Updated last year
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆176Updated 8 months ago
- ☆120Updated 7 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆105Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆155Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆116Updated 11 months ago
- Reproducible, flexible LLM evaluations☆200Updated last week
- ☆177Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆117Updated 5 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆207Updated 4 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆68Updated 3 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆193Updated 9 months ago
- ☆165Updated last month
- A brief and partial summary of RLHF algorithms.☆128Updated 2 months ago
- The HELMET Benchmark☆143Updated 3 weeks ago
- PyTorch library for Active Fine-Tuning☆72Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆109Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆411Updated last year
- Direct Preference Optimization from scratch in PyTorch☆91Updated last month
- This is the official repository for Inheritune.☆111Updated 3 months ago
- ☆112Updated 5 months ago
- ☆72Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆172Updated 3 months ago