yxli2123/LoSparse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yxli2123/LoSparse)

yxli2123 / LoSparse

☆64

Alternatives and similar repositories for LoSparse

Users that are interested in LoSparse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RahulSChand / Weighted-low-rank-factorization-Pytorch
View on GitHub
PyTorch implementation of Language model compression with weighted low-rank factorization
☆14Jun 28, 2023Updated 3 years ago
luuyin / OWL
View on GitHub
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
☆81Jul 7, 2025Updated last year
VITA-Group / R-Sparse
View on GitHub
[ICLR'25] R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
☆21Apr 28, 2025Updated last year
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
syjmelody / RankE
View on GitHub
Implementation of RankE: End-to-End Discrete Text-to-Image Post-Training via Rank-Consistent Alignment
☆20May 27, 2026Updated last month
South-hw / FedPara_ICLR22
View on GitHub
☆12Dec 26, 2024Updated last year
hwang595 / Cuttlefish
View on GitHub
The implementation for MLSys 2023 paper: "Cuttlefish: Low-rank Model Training without All The Tuning"
☆44May 10, 2023Updated 3 years ago
uiuctml / Localize-and-Stitch
View on GitHub
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆32Feb 18, 2026Updated 5 months ago
ChengZhang-98 / LQER
View on GitHub
Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"
☆19Jul 11, 2024Updated 2 years ago
NeuraLiying / FreqExit
View on GitHub
[NeurIPS'25] FreqExit: Enabling Early-Exit Inference for Visual Autoregressive Models via Frequency-Aware Guidance
☆21Dec 15, 2025Updated 7 months ago
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆42Feb 4, 2025Updated last year
prateeky2806 / ComPEFT
View on GitHub
☆26Nov 23, 2023Updated 2 years ago
dropbox / low-rank-llama2
View on GitHub
Low-Rank Llama Custom Training
☆23Mar 27, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆70Mar 27, 2025Updated last year
chuny9743 / AI4WaterEnv
View on GitHub
Webpage: https://chuny9743.github.io/AI4WaterEnv_Webpage/
☆35Apr 14, 2026Updated 3 months ago
SempraETY / Pruning-via-Merging
View on GitHub
☆23Nov 26, 2024Updated last year
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
maszhongming / ParaKnowTransfer
View on GitHub
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆33May 9, 2024Updated 2 years ago
pprp / ACBench
View on GitHub
[ICML25] Agentic Compression Benchmark (ACBench)
☆17Jul 2, 2025Updated last year
fscdc / Oracle-Pruning-Sanity-Check
View on GitHub
[TMLR 2026] Is Oracle Pruning the True Oracle?
☆33Jul 1, 2026Updated 3 weeks ago
fuzihaofzh / AnalyzeParameterEfficientFinetune
View on GitHub
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆39Nov 4, 2023Updated 2 years ago
naver-aics / lut-gemm
View on GitHub
☆82Apr 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
QingruZhang / PLATON
View on GitHub
This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆45Oct 17, 2022Updated 3 years ago
fmfi-compbio / admm-pruning
View on GitHub
☆30Jul 22, 2024Updated 2 years ago
wyzjack / CNTP
View on GitHub
[ACL 2025] Cautious Next Token Prediction
☆16Jul 24, 2025Updated last year
HEAP-Lab-VT / ASIC-DEFLATE-for-memory
View on GitHub
hardware (ASIC) DEFLATE designed for low-latency page-granularity memory compression and implemented in Chisel
☆16Nov 15, 2024Updated last year
ArminAzizi98 / LaMDA
View on GitHub
☆15Nov 7, 2024Updated last year
IST-DASLab / HALO
View on GitHub
HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…
☆31Feb 17, 2025Updated last year
IST-DASLab / Sparse-Marlin
View on GitHub
Boosting 4-bit inference kernels with 2:4 Sparsity
☆96Sep 4, 2024Updated last year
KD-TAO / LVOmniBench
View on GitHub
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
☆41Apr 2, 2026Updated 3 months ago
StyxXuan / LoraRetriever
View on GitHub
☆17Apr 29, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
yangyifei729 / LaCo
View on GitHub
Official implementation for LaCo (EMNLP 2024 Findings)
☆22Oct 3, 2024Updated last year
aaronserianni / training-free-nas
View on GitHub
[ACL'22] Training-free Neural Architecture Search for RNNs and Transformers
☆14May 26, 2024Updated 2 years ago
FMInference / DejaVu
View on GitHub
☆359Apr 2, 2024Updated 2 years ago
mingluo-su / ROSE
View on GitHub
[CPAL 2026 oral] Offical implementation of "ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning”
☆16Apr 21, 2026Updated 3 months ago
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆50Apr 9, 2024Updated 2 years ago
horseee / Awesome-Efficient-LLM
View on GitHub
A curated list for Efficient Large Language Models
☆2,023Jun 17, 2025Updated last year