stephenqz/OATS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stephenqz/OATS)

stephenqz / OATS

Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition

☆20

Alternatives and similar repositories for OATS

Users that are interested in OATS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wzhuang-xmu / LoSA
View on GitHub
[ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".
☆25Mar 16, 2025Updated last year
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated 2 years ago
ROIM1998 / APT
View on GitHub
[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
☆48Jun 4, 2024Updated 2 years ago
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆70Mar 27, 2025Updated last year
thu-ml / Adaptive-Sparse-Trainer
View on GitHub
Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)
☆19Jul 1, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
IST-DASLab / MicroAdam
View on GitHub
This repository contains code for the MicroAdam paper.
☆21Dec 14, 2024Updated last year
IST-DASLab / HALO
View on GitHub
HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…
☆31Feb 17, 2025Updated last year
IST-DASLab / EvoPress
View on GitHub
☆43Jun 14, 2026Updated last month
luuyin / OWL
View on GitHub
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
☆81Jul 7, 2025Updated last year
OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆42Feb 4, 2025Updated last year
Axel-gu / DenoiseRotator
View on GitHub
☆22Nov 26, 2025Updated 7 months ago
ZhengaoLi / DISP-LLM-Dimension-Independent-Structural-Pruning
View on GitHub
An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.
☆25Aug 6, 2025Updated 11 months ago
Qualcomm-AI-research / llm-surgeon
View on GitHub
☆35May 24, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AIoT-MLSys-Lab / SVD-LLM
View on GitHub
[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2
☆302Aug 28, 2025Updated 10 months ago
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated last year
NYCU-EDgeAi / subspec
View on GitHub
[NeurIPS 2025] Speculate Deep and Accurate
☆21Jan 16, 2026Updated 6 months ago
LOG-postech / rethinking-LLM-pruning
View on GitHub
[EMNLP 2024] Official implementation of "Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimiza…
☆28Feb 21, 2025Updated last year
pprp / Awesome-LLM-Prune
View on GitHub
Awesome list for LLM pruning.
☆297Oct 11, 2025Updated 9 months ago
rhubarbwu / neural-collapse
View on GitHub
Generic library for neural collapse and several derivative works on the phenomenon.
☆18Apr 14, 2025Updated last year
VITA-Group / WeLore
View on GitHub
[ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications
☆52Oct 30, 2025Updated 8 months ago
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆50Apr 9, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
RUCKBReasoning / LLM-Streamline
View on GitHub
Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"
☆43May 1, 2025Updated last year
SalesforceAIResearch / ThinK
View on GitHub
ThinK: Thinner Key Cache by Query-Driven Pruning
☆30Jun 2, 2026Updated last month
andyjm3 / SLTrain
View on GitHub
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆39Nov 1, 2024Updated last year
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
Paramathic / slim
View on GitHub
SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)
☆37Nov 28, 2025Updated 7 months ago
LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
abhibambhaniya / progressive_gradient_flow_nm_sparsity
View on GitHub
Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".
☆11Feb 5, 2024Updated 2 years ago
jeffreyyu0602 / quantized-training
View on GitHub
☆35Dec 22, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thu-ml / TetraJet-MXFP4Training
View on GitHub
Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training
☆40May 4, 2026Updated 2 months ago
k1l1 / CoCoFL
View on GitHub
CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization
☆13Aug 3, 2024Updated last year
shawnricecake / search-llm
View on GitHub
[NeurIPS 2024] Search for Efficient LLMs
☆16Jan 16, 2025Updated last year
XiaoyuanXie / xiaoyuanxie.github.io
View on GitHub
Personal Page
☆12Jul 4, 2026Updated 2 weeks ago
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
LKJacky / Differentiable-Model-Scaling
View on GitHub
This is the official repo for "Differentiable Model Scaling using Differentiable Topk"
☆12May 16, 2024Updated 2 years ago
yuxwind / CBS
View on GitHub
Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]
☆16Sep 20, 2022Updated 3 years ago