hahahawu/Long-to-Short-via-Model-Merging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hahahawu/Long-to-Short-via-Model-Merging)

hahahawu / Long-to-Short-via-Model-Merging

Model merging is a highly efficient approach for long-to-short reasoning.

☆103

Alternatives and similar repositories for Long-to-Short-via-Model-Merging

Users that are interested in Long-to-Short-via-Model-Merging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

starrYYxuan / UniTE
View on GitHub
☆17Nov 20, 2024Updated last year
WalkerWorldPeace / DOGE
View on GitHub
Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".
☆23May 23, 2025Updated last year
uiuctml / MergeBench
View on GitHub
[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆47Feb 11, 2026Updated 5 months ago
Hongcheng-Gao / Awesome-Long2short-on-LRMs
View on GitHub
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆262Mar 7, 2026Updated 4 months ago
hemingkx / TokenSkip
View on GitHub
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆224Nov 30, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆236Jun 23, 2026Updated last month
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated 2 weeks ago
shiqichen17 / VLM_Merging
View on GitHub
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆89Jun 9, 2026Updated last month
ahnobari / ActivationInformedMerging
View on GitHub
Official repository for Activation-Informed Merging (AIM) of Large Language Models
☆24Feb 10, 2025Updated last year
emmyqin / iw_sft
View on GitHub
☆28Jul 18, 2025Updated last year
WalkerWorldPeace / MLLMerging
View on GitHub
ICLR 2026 "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".
☆57Jun 18, 2026Updated last month
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
tanganke / peta
View on GitHub
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆26Sep 13, 2024Updated last year
harveyhuang18 / EMR_Merging
View on GitHub
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆82Mar 1, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
QwenLM / ProcessBench
View on GitHub
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆190May 20, 2025Updated last year
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆114Oct 28, 2024Updated last year
nathanielyvo / WUDI-Merging
View on GitHub
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆50Oct 1, 2025Updated 9 months ago
Zayne-sprague / To-CoT-or-not-to-CoT
View on GitHub
☆26Apr 10, 2025Updated last year
eddycmu / demystify-long-cot
View on GitHub
☆336May 31, 2025Updated last year
yuelinan / Awesome-Efficient-R1-style-LRMs
View on GitHub
☆53Jul 12, 2026Updated 2 weeks ago
hemingkx / Awesome-Efficient-Reasoning
View on GitHub
Paper list for Efficient Reasoning.
☆898May 29, 2026Updated 2 months ago
MrZilinXiao / ProxyThinker
View on GitHub
[ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.
☆22Sep 24, 2025Updated 10 months ago
AlphaLab-USTC / LRM-plans-CoT
View on GitHub
[NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"
☆31Jul 6, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
StarDewXXX / O1-Pruner
View on GitHub
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆100Feb 21, 2025Updated last year
AIM-SKKU / ADAPT
View on GitHub
[NeurIPS 2025] Backpropagation-Free Test-Time Adaptation via Probabilistic Gaussian Alignment
☆22Mar 18, 2026Updated 4 months ago
Eclipsess / Awesome-Efficient-Reasoning-LLMs
View on GitHub
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆786Feb 28, 2026Updated 5 months ago
wenlinyao / HDFlow
View on GitHub
Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows
☆15Oct 4, 2024Updated last year
Liyan06 / ChartMuseum
View on GitHub
[NeurIPS 2025] ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
☆24Apr 20, 2026Updated 3 months ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆13Feb 11, 2026Updated 5 months ago
AntoAndGar / task_singular_vectors
View on GitHub
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆57Dec 15, 2025Updated 7 months ago
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆33Jan 20, 2025Updated last year
EvanZhuang / mixinputs
View on GitHub
Official implementation for Text Generation Beyond Discrete Token Sampling
☆26Aug 11, 2025Updated 11 months ago
gstoica27 / KnOTS
View on GitHub
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆94Apr 3, 2025Updated last year
PRIME-RL / Entropy-Mechanism-of-RL
View on GitHub
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆446Jul 11, 2025Updated last year
OrangeInSouth / DeePEn
View on GitHub
A method of ensemble learning for heterogeneous large language models.
☆62Aug 7, 2024Updated last year
duguodong7 / pcb-merging
View on GitHub
[NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging
☆48Oct 11, 2024Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago