ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZhengaoLi/DISP-LLM-Dimension-Independent-Structural-Pruning)

ZhengaoLi / DISP-LLM-Dimension-Independent-Structural-Pruning

An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.

☆25

Alternatives and similar repositories for DISP-LLM-Dimension-Independent-Structural-Pruning

Users that are interested in DISP-LLM-Dimension-Independent-Structural-Pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rezashkv / diffusion_pruning
View on GitHub
[ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.
☆15Feb 1, 2025Updated last year
LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
shawnricecake / search-llm
View on GitHub
[NeurIPS 2024] Search for Efficient LLMs
☆16Jan 16, 2025Updated last year
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stephenqz / OATS
View on GitHub
Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition
☆20Apr 16, 2025Updated last year
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
flash-bon / flash-bon
View on GitHub
(ECCV 2026): Official code for Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models
☆18Jul 9, 2026Updated 3 weeks ago
Qualcomm-AI-research / llm-surgeon
View on GitHub
☆35May 24, 2024Updated 2 years ago
DZY122 / DiTAS
View on GitHub
DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)
☆13Feb 7, 2026Updated 5 months ago
CASIA-LMC-Lab / FLAP
View on GitHub
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
☆76Jan 6, 2024Updated 2 years ago
LukasHedegaard / structured-pruning-adapters
View on GitHub
Structured Pruning Adapters in PyTorch
☆19Aug 30, 2023Updated 2 years ago
29DCH / AI_ML_DataAnalysis_DataVisualization_Classic-Examples
View on GitHub
关于AI,ML,DA,DV等的几个经典案例，包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)
☆10Oct 14, 2018Updated 7 years ago
abhibambhaniya / progressive_gradient_flow_nm_sparsity
View on GitHub
Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".
☆11Feb 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kridgeway / f-statistic-loss-nips-2018
View on GitHub
Learning Deep Disentangled Embeddings with the F-Statistic Loss (NIPS 2018)
☆10Oct 17, 2018Updated 7 years ago
yifanycc / AdaZeta
View on GitHub
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆13Dec 15, 2024Updated last year
chuliang007 / resnet20_training
View on GitHub
☆11Aug 2, 2024Updated last year
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
abdelfattah-lab / BRAMAC
View on GitHub
☆10Nov 27, 2024Updated last year
VITA-Group / SEAL
View on GitHub
[COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆62Apr 6, 2025Updated last year
yuxwind / CBS
View on GitHub
Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]
☆16Sep 20, 2022Updated 3 years ago
airaria / GRAIN
View on GitHub
GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models
☆19Jul 12, 2023Updated 3 years ago
Dejiao2018 / GrOWL
View on GitHub
Learning to share: simultaneous parameter tying and sparsification in deep learning
☆13Aug 21, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
casenoone / vortex-particles-method-2d
View on GitHub
vortex particles for simulating smoke in 2d
☆17Dec 13, 2021Updated 4 years ago
IST-DASLab / MicroAdam
View on GitHub
This repository contains code for the MicroAdam paper.
☆21Dec 14, 2024Updated last year
ruchikachavhan / concept-prune
View on GitHub
Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
☆24Aug 13, 2024Updated last year
Zengwh02 / GlimpRouter
View on GitHub
GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts
☆16Apr 24, 2026Updated 3 months ago
Alrightlone / OBS-Diff
View on GitHub
[ICLR 2026] Offical implementation of "OBS-Diff".
☆66Mar 5, 2026Updated 4 months ago
twinkle0331 / Xcompression
View on GitHub
[ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)
☆22May 24, 2023Updated 3 years ago
SempraETY / Pruning-via-Merging
View on GitHub
☆23Nov 26, 2024Updated last year
Hyaloid / AccSpMM
View on GitHub
Official implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.
☆17Nov 13, 2025Updated 8 months ago
allenai / hyperdecoders
View on GitHub
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆14Oct 11, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nolfwin / symloss-ber-auc
View on GitHub
Code for the paper: On Symmetric Losses for Learning from Corrupted Labels
☆19May 11, 2019Updated 7 years ago
IBM / ZOSVRG-BlackBox-Adv
View on GitHub
ZOSVRG-BlackBox-Adv
☆12Oct 30, 2018Updated 7 years ago
gmum / Zero-Time-Waste
View on GitHub
☆15Dec 19, 2023Updated 2 years ago
Mxbonn / ltmp
View on GitHub
Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…
☆17Nov 24, 2024Updated last year
theyoungkwon / TinyTrain
View on GitHub
The official implementation of TinyTrain [ICML '24]
☆28Jul 19, 2024Updated 2 years ago
luuyin / OWL
View on GitHub
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
☆82Jul 7, 2025Updated last year
JeremieMelo / L2ight
View on GitHub
☆26Nov 10, 2021Updated 4 years ago