Exploring Model Kinship for Merging Large Language Models
☆27Apr 16, 2025Updated 10 months ago
Alternatives and similar repositories for ModelKinship
Users that are interested in ModelKinship are comparing it to the libraries listed below
Sorting:
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- ☆12Jul 30, 2025Updated 7 months ago
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆16Sep 17, 2025Updated 5 months ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆13Sep 2, 2024Updated last year
- ☆24Jan 6, 2026Updated last month
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆18Mar 30, 2025Updated 11 months ago
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆50Apr 9, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".☆43Oct 30, 2025Updated 4 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 2 months ago
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆31Jan 28, 2026Updated last month
- ☆33Jul 8, 2024Updated last year
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆29Mar 11, 2025Updated 11 months ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Sep 13, 2024Updated last year
- ☆31Mar 27, 2025Updated 11 months ago
- ☆58Oct 4, 2025Updated 4 months ago
- ☆37Jan 26, 2024Updated 2 years ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- ☆41Jan 28, 2026Updated last month
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- ☆12Jun 17, 2025Updated 8 months ago
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated 9 months ago
- ☆11Jul 17, 2023Updated 2 years ago
- Evolutionary Multi-objective Optimization based Neural Architecture Search for Cognitive Diagnosis☆12Sep 5, 2024Updated last year
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- Unofficial Implementation of Evolutionary Model Merging☆41Mar 28, 2024Updated last year
- ☆104Oct 30, 2023Updated 2 years ago
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆13Mar 2, 2025Updated last year
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 5 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆13Jun 1, 2024Updated last year
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago