Digitous/LLM-SLERP-Merge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Digitous/LLM-SLERP-Merge)

Digitous / LLM-SLERP-Merge

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

☆153

Alternatives and similar repositories for LLM-SLERP-Merge

Users that are interested in LLM-SLERP-Merge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Digitous / ModelREVOLVER
View on GitHub
Model REVOLVER, a human in the loop model mixing system.
☆33Aug 2, 2023Updated 2 years ago
54rt1n / shardmerge
View on GitHub
Using fourier interpolation to merge large language models
☆11Jul 11, 2026Updated last week
TehVenomm / LM_Transformers_BlockMerge
View on GitHub
Image Diffusion block merging technique applied to transformers based Language Models.
☆55May 8, 2023Updated 3 years ago
prateeky2806 / ties-merging
View on GitHub
☆215Feb 3, 2024Updated 2 years ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆12Feb 11, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆869May 5, 2024Updated 2 years ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,246Jun 17, 2026Updated last month
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆214Aug 8, 2024Updated last year
arcee-ai / PruneMe
View on GitHub
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆266Apr 23, 2024Updated 2 years ago
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
Gryphe / MergeMonster
View on GitHub
An unsupervised model merging algorithm for Transformers-based language models.
☆107Apr 29, 2024Updated 2 years ago
r-three / phatgoose
View on GitHub
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆93Feb 27, 2024Updated 2 years ago
Model-GLUE / Model-GLUE
View on GitHub
☆18Aug 19, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
Debrup-61 / RaDeR
View on GitHub
Official Code Repositiry for "RaDeR: Reasoning-aware Dense Retrieval Models" accepted at Main Conference EMNLP 2025
☆18Jun 23, 2025Updated last year
yukyunglee / transformers-resources
View on GitHub
huggingface transformers tutorial, code, resources
☆26Apr 7, 2024Updated 2 years ago
r-three / RAD
View on GitHub
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆45Oct 1, 2025Updated 9 months ago
CarperAI / decontamination
View on GitHub
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆28Apr 21, 2023Updated 3 years ago
RobertCsordas / moe_attention
View on GitHub
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆101Sep 30, 2024Updated last year
lezhang7 / Retrieval_MuGI
View on GitHub
[EMNLP'2024 Findings] Explore generated documents for enhanced IR with LLMs. We enhance BM25 to surpass strong dense retriever on many da…
☆14Mar 28, 2025Updated last year
iantbutler01 / ditty
View on GitHub
A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.
☆16Jun 10, 2026Updated last month
SakanaAI / evolutionary-model-merge
View on GitHub
Official repository of Evolutionary Optimization of Model Merging Recipes
☆1,436Nov 29, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
martyn / safetensors-merge-supermario
View on GitHub
Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…
☆83Oct 17, 2024Updated last year
tanganke / opcm
View on GitHub
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆25Oct 11, 2025Updated 9 months ago
DocShotgun / LLM-notebooks
View on GitHub
Jupyter notebooks for cloud-based usage
☆10Aug 26, 2023Updated 2 years ago
mzalaya / screenkhorn
View on GitHub
Code for NeurIPS 2019 paper "Screening Sinkhorn Algorithm for Regularized Optimal Transport"
☆10Feb 10, 2020Updated 6 years ago
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
severian42 / Computational-Model-for-Symbolic-Representations
View on GitHub
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆57Feb 10, 2025Updated last year
kallewoof / lora-merge
View on GitHub
A script for merging a LLM model and a LoRA
☆13Jun 22, 2023Updated 3 years ago
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
j-webtek / LLM-Learning
View on GitHub
A repository to store helpful information and emerging insights in regard to LLMs
☆21Oct 27, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lartpang / RunIt
View on GitHub
A simple program scheduler for your code on different devices.
☆12Mar 8, 2026Updated 4 months ago
Cohere-Labs-Community / parameter-efficient-moe
View on GitHub
☆277Oct 31, 2023Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
practical-dreamer / vicuna_to_alpacan
View on GitHub
Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer
☆12Jun 21, 2023Updated 3 years ago
r-three / smear
View on GitHub
☆30Sep 28, 2023Updated 2 years ago
nbasyl / DoRA
View on GitHub
Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"
☆122Apr 28, 2024Updated 2 years ago
maj34 / Legal_Specific_KoLLM
View on GitHub
[ Text Analytics ] 법률 도메인 특화 한국어 기반 LLM 개발
☆15Sep 14, 2025Updated 10 months ago