VILA-Lab/GBLM-Pruner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VILA-Lab/GBLM-Pruner)

VILA-Lab / GBLM-Pruner

Are gradient information useful for pruning of LLMs?

☆48

Alternatives and similar repositories for GBLM-Pruner

Users that are interested in GBLM-Pruner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pprp / Pruner-Zero
View on GitHub
[ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
☆100Nov 25, 2024Updated last year
mbzuai-nlp / x-claim
View on GitHub
EMNLP 2023 (main): Extractive Multilingual Claim Span Identification
☆13Jan 22, 2024Updated 2 years ago
OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated 2 years ago
Qualcomm-AI-research / llm-surgeon
View on GitHub
☆35May 24, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FeiyuZhang98 / IncreLoRA
View on GitHub
☆36Aug 23, 2023Updated 2 years ago
amandpkr / XM-GAN
View on GitHub
[MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…
☆47Sep 28, 2023Updated 2 years ago
dair-iitd / FloDial
View on GitHub
☆12May 18, 2022Updated 4 years ago
luuyin / OWL
View on GitHub
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
☆81Jul 7, 2025Updated last year
haiquanlu / AlphaPruning
View on GitHub
[NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
☆34Jun 9, 2025Updated last year
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆50Apr 9, 2024Updated 2 years ago
yifanycc / AdaZeta
View on GitHub
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆13Dec 15, 2024Updated last year
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
acl-org / aclrollingreview
View on GitHub
ACL Rolling Review website
☆12Jul 15, 2026Updated last week
cjyaras / deep-lora-transformers
View on GitHub
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆12Jul 22, 2024Updated 2 years ago
aaronserianni / training-free-nas
View on GitHub
[ACL'22] Training-free Neural Architecture Search for RNNs and Transformers
☆14May 26, 2024Updated 2 years ago
IST-DASLab / DarwinLM
View on GitHub
Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"
☆20Feb 21, 2025Updated last year
abhinavkashyap / domadapter
View on GitHub
Domain Adaptation and Adapters
☆16Feb 28, 2023Updated 3 years ago
imagination-research / LCSC
View on GitHub
[ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
☆16Feb 15, 2025Updated last year
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆42Feb 4, 2025Updated last year
locuslab / wanda
View on GitHub
A simple and effective LLM pruning approach.
☆869Aug 9, 2024Updated last year
ikergarcia1996 / T-Projection
View on GitHub
T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.
☆13Nov 21, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
miaozhang0525 / iDARTS
View on GitHub
codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients
☆10May 27, 2021Updated 5 years ago
IST-DASLab / sparsegpt
View on GitHub
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
☆890Aug 20, 2024Updated last year
amandpkr / Efficient-3D-Aware-Facial-Image-Editing
View on GitHub
[ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"
☆10Aug 2, 2024Updated last year
minhoooo1 / CatMAE
View on GitHub
CatMAE
☆15Dec 13, 2023Updated 2 years ago
Lucky-Lance / SPP
View on GitHub
[ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
☆22May 28, 2024Updated 2 years ago
wzhuang-xmu / LoSA
View on GitHub
[ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".
☆25Mar 16, 2025Updated last year
yifanycc / loretta
View on GitHub
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆39Jan 9, 2025Updated last year
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆70Mar 27, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
parsa-epfl / quantization-sparsity-interplay
View on GitHub
This repo contains the code for studying the interplay between quantization and sparsity methods
☆26Feb 26, 2025Updated last year
aim-uofa / LoRAPrune
View on GitHub
☆63Dec 15, 2024Updated last year
mbzuai-nlp / DetectLLM
View on GitHub
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
☆35Jul 26, 2023Updated 2 years ago
NivNayman / XNAS
View on GitHub
☆18Nov 6, 2019Updated 6 years ago
tobiasvanderwerff / MetaHTR
View on GitHub
Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).
☆14Jun 22, 2022Updated 4 years ago
nku-zhichengzhang / MART
View on GitHub
[CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…
☆22Jun 14, 2025Updated last year
William-wAng618 / M2PT
View on GitHub
Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning
☆29Mar 23, 2025Updated last year