yuchaoli / PSTLinks

Source code for IJCAI 2022 Long paper: Parameter-Efficient Sparsity for Large Language Models Fine-Tuning.

☆15

Alternatives and similar repositories for PST

Users that are interested in PST are comparing it to the libraries listed below

Sorting:

OpenGVLab / LLMPrune-BESA
BESA is a differentiable weight pruning technique for large language models.
☆17Updated last year
yxli2123 / LoSparse
☆59Updated last year
CASIA-IVA-Lab / FLAP
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
☆61Updated last year
ziplab / QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆29Updated last year
wzhuang-xmu / LoSA
[ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".
☆21Updated 6 months ago
QingruZhang / PLATON
This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆46Updated 2 years ago
VITA-Group / UVC
[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…
☆54Updated last year
HuangOwen / QAT-ACS
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆34Updated last year
kssteven418 / LTP
[KDD'22] Learned Token Pruning for Transformers
☆100Updated 2 years ago
htqin / DSG
This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…
☆14Updated 2 years ago
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆40Updated last week
LukasHedegaard / structured-pruning-adapters
Structured Pruning Adapters in PyTorch
☆19Updated 2 years ago
Shiweiliuiiiiiii / In-Time-Over-Parameterization
[ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…
☆45Updated last year
VITA-Group / SViTE
[NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…
☆89Updated last year
VITA-Group / Random_Pruning
[ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…
☆76Updated 2 years ago
VILA-Lab / GBLM-Pruner
Are gradient information useful for pruning of LLMs?
☆46Updated 3 weeks ago
zyxxmu / DSnoT
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆49Updated last year
htqin / IR-QLoRA
[ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…
☆67Updated last year
fmfi-compbio / admm-pruning
☆29Updated last year
htqin / BiBERT
This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.
☆88Updated 2 years ago
boone891214 / MEST
[NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…
☆18Updated 3 years ago
MingSun-Tse / TPP
[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)
☆34Updated 2 years ago
WoosukKwon / retraining-free-pruning
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
☆191Updated 2 years ago
MingSun-Tse / Awesome-Pruning-at-Initialization
[IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.
☆59Updated last year
Intelligent-Computing-Lab-Panda / TesseraQ
☆22Updated 10 months ago
VITA-Group / Random-MoE-as-Dropout
[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…
☆55Updated 2 years ago
shawnricecake / search-llm
[NeurIPS 2024] Search for Efficient LLMs
☆15Updated 8 months ago
ModelTC / Outlier_Suppression_Plus
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆46Updated last year
shoaibahmed / llm_depth_pruning
Official implementation of the paper: "A deeper look at depth pruning of LLMs"
☆15Updated last year
txsun1997 / awesome-early-exiting
A curated list of Early Exiting papers, benchmarks, and misc.
☆117Updated last year