arcee-ai/PruneMe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/arcee-ai/PruneMe)

arcee-ai / PruneMe

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

☆267

Alternatives and similar repositories for PruneMe

Users that are interested in PruneMe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sramshetty / ShortGPT
View on GitHub
Unofficial implementations of block/layer-wise pruning methods for LLMs.
☆78Apr 29, 2024Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
arcee-ai / DistillKit
View on GitHub
An Open Source Toolkit For LLM Distillation
☆990May 12, 2026Updated 2 months ago
MrGGLS / BlockPruner
View on GitHub
A block pruning framework for LLMs.
☆28May 17, 2025Updated last year
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆42Feb 4, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Digitous / ModelREVOLVER
View on GitHub
Model REVOLVER, a human in the loop model mixing system.
☆33Aug 2, 2023Updated 2 years ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,261Jun 17, 2026Updated last month
horseee / LLM-Pruner
View on GitHub
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…
☆1,133Oct 7, 2024Updated last year
QuixiAI / spectrum
View on GitHub
☆145Aug 20, 2025Updated 11 months ago
locuslab / wanda
View on GitHub
A simple and effective LLM pruning approach.
☆868Aug 9, 2024Updated last year
fangyuan-ksgk / Evolutionary-Model-Merge
View on GitHub
Unofficial Implementation of Evolutionary Model Merging
☆42Mar 28, 2024Updated 2 years ago
arcee-ai / EvolKit
View on GitHub
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆258Oct 30, 2024Updated last year
melisa-writer / short-transformers
View on GitHub
Prune transformer layers
☆74May 30, 2024Updated 2 years ago
Digitous / LLM-SLERP-Merge
View on GitHub
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆153Sep 10, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
VITA-Group / llm-kick
View on GitHub
[ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.
☆27Apr 21, 2025Updated last year
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
Gryphe / MergeMonster
View on GitHub
An unsupervised model merging algorithm for Transformers-based language models.
☆107Apr 29, 2024Updated 2 years ago
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated 2 years ago
l4b4r4b4b4 / AIDocks
View on GitHub
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Feb 18, 2024Updated 2 years ago
arcee-ai / DAM
View on GitHub
☆56Nov 6, 2024Updated last year
dphnAI / sonar
View on GitHub
Large-scale LLM inference engine
☆1,810Updated this week
jukofyork / transplant-vocab
View on GitHub
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆54Oct 29, 2025Updated 8 months ago
Pleias / Various-Finetuning
View on GitHub
Set of scripts to finetune LLMs
☆38Mar 30, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zju-vipa / training_free_model_merging
View on GitHub
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆34Mar 5, 2024Updated 2 years ago
RichardKelley / hflm
View on GitHub
A simple library for working with Hugging Face models.
☆14Dec 30, 2024Updated last year
Leeroo-AI / mergoo
View on GitHub
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆518Aug 26, 2024Updated last year
microsoft / TransformerCompression
View on GitHub
For releasing code related to compression methods for transformers, accompanying our publications
☆461Jan 16, 2025Updated last year
princeton-nlp / LLM-Shearing
View on GitHub
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆643Mar 4, 2024Updated 2 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
davidkim205 / translation
View on GitHub
☆13Apr 17, 2024Updated 2 years ago
Aratako / Task-Vector-Merge-Optimzier
View on GitHub
☆16Apr 11, 2024Updated 2 years ago
huggingface / llm-swarm
View on GitHub
Manage scalable open LLM inference endpoints in Slurm clusters
☆289Jul 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
flowritecom / flow-merge
View on GitHub
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…
☆21Feb 12, 2025Updated last year
SempraETY / Pruning-via-Merging
View on GitHub
☆23Nov 26, 2024Updated last year
LostRuins / datasetexplorer
View on GitHub
Easily view and modify JSON datasets for large language models
☆90May 16, 2025Updated last year
FailSpy / abliterator
View on GitHub
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆678Jun 11, 2024Updated 2 years ago
jukofyork / control-vectors
View on GitHub
Genertaes control vectors for use with llama.cpp in GGUF format.
☆48Mar 19, 2025Updated last year
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
54rt1n / shardmerge
View on GitHub
Using fourier interpolation to merge large language models
☆11Jul 11, 2026Updated 2 weeks ago