CASIA-LMC-Lab/FLAP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CASIA-LMC-Lab/FLAP)

CASIA-LMC-Lab / FLAP

[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models

☆76

Alternatives and similar repositories for FLAP

Users that are interested in FLAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
VITA-Group / llm-kick
View on GitHub
[ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.
☆27Apr 21, 2025Updated last year
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆41Feb 4, 2025Updated last year
yangyifei729 / LaCo
View on GitHub
Official implementation for LaCo (EMNLP 2024 Findings)
☆22Oct 3, 2024Updated last year
snu-mllab / LayerMerge
View on GitHub
Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)
☆31Apr 13, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
horseee / LLM-Pruner
View on GitHub
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…
☆1,130Oct 7, 2024Updated last year
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆70Mar 27, 2025Updated last year
luuyin / OWL
View on GitHub
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
☆81Jul 7, 2025Updated last year
IST-DASLab / sparsegpt
View on GitHub
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
☆889Aug 20, 2024Updated last year
pprp / Awesome-LLM-Prune
View on GitHub
Awesome list for LLM pruning.
☆297Oct 11, 2025Updated 9 months ago
WoosukKwon / retraining-free-pruning
View on GitHub
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
☆197Feb 28, 2023Updated 3 years ago
SempraETY / Pruning-via-Merging
View on GitHub
☆23Nov 26, 2024Updated last year
pprp / Pruner-Zero
View on GitHub
[ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
☆100Nov 25, 2024Updated last year
locuslab / wanda
View on GitHub
A simple and effective LLM pruning approach.
☆868Aug 9, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ylsung / ECoFLaP
View on GitHub
Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)
☆21Feb 16, 2024Updated 2 years ago
princeton-nlp / LLM-Shearing
View on GitHub
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆640Mar 4, 2024Updated 2 years ago
IST-DASLab / EvoPress
View on GitHub
☆43Jun 14, 2026Updated last month
GATECH-EIC / AmoebaLLM
View on GitHub
[NeurIPS 2024] "AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment" by Yonggan Fu, Zhongzhi Yu,…
☆19Dec 13, 2024Updated last year
OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
ROIM1998 / APT
View on GitHub
[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
☆48Jun 4, 2024Updated 2 years ago
microsoft / geta
View on GitHub
[CVPR 2025] Official repository for GETA
☆43Nov 5, 2025Updated 8 months ago
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated last year
abdelfattah-lab / shadow_llm
View on GitHub
☆11Sep 20, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lippman1125 / channel_pruning_lasso
View on GitHub
channel pruning for accelerating very deep neural networks
☆13Mar 8, 2021Updated 5 years ago
Nota-NetsPresso / SNP
View on GitHub
Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]
☆16Aug 7, 2024Updated last year
sramshetty / ShortGPT
View on GitHub
Unofficial implementations of block/layer-wise pruning methods for LLMs.
☆78Apr 29, 2024Updated 2 years ago
NVlabs / MaskLLM
View on GitHub
[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models
☆189Jan 1, 2025Updated last year
xingyizhao / PURE
View on GitHub
Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…
☆11Feb 22, 2026Updated 4 months ago
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
kkeono2 / Channel-Pruning-using-Thinet-LASSO-
View on GitHub
☆14Dec 4, 2020Updated 5 years ago
imagination-research / LCSC
View on GitHub
[ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
☆16Feb 15, 2025Updated last year
IST-DASLab / DarwinLM
View on GitHub
Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"
☆20Feb 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alexandertheus / Intra-Fusion
View on GitHub
Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)
☆18Dec 5, 2024Updated last year
parsa-epfl / quantization-sparsity-interplay
View on GitHub
This repo contains the code for studying the interplay between quantization and sparsity methods
☆26Feb 26, 2025Updated last year
microsoft / TransformerCompression
View on GitHub
For releasing code related to compression methods for transformers, accompanying our publications
☆461Jan 16, 2025Updated last year
zyxxmu / Bi-Mask
View on GitHub
Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"
☆13Jun 7, 2023Updated 3 years ago
shawnricecake / search-llm
View on GitHub
[NeurIPS 2024] Search for Efficient LLMs
☆16Jan 16, 2025Updated last year
hdong920 / GRIFFIN
View on GitHub
☆40Aug 27, 2024Updated last year
NayMyatMin / CROW
View on GitHub
Internal Consistency Regularization (CROW) for LLM Backdoor Elimination - Paper accepted to ICML 2025
☆16May 6, 2025Updated last year