Aaquib111 / Sparse-GPT-FinetuningLinks

Code for my ICLR 2024 TinyPapers paper "Prune and Tune: Improving Efficient Pruning Techniques for Massive Language Models"

☆16

Alternatives and similar repositories for Sparse-GPT-Finetuning

Users that are interested in Sparse-GPT-Finetuning are comparing it to the libraries listed below

Sorting:

YuchuanTian / RethinkTinyLM
[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”
☆123Updated 9 months ago
song-wx / SIFT
[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely
☆22Updated last year
ldery / Bonsai
Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"
☆28Updated last year
RobertCsordas / moe_attention
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆99Updated last year
kamanphoebe / Look-into-MoEs
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆55Updated 8 months ago
Infini-AI-Lab / S2FT
☆19Updated 9 months ago
IST-DASLab / RoSA
Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)
☆44Updated last year
LiqunMa / FBI-LLM
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
☆51Updated last month
18907305772 / FuseAI
FuseAI Project
☆87Updated 8 months ago
thu-ml / low-bit-optimizers
Low-bit optimizers for PyTorch
☆131Updated 2 years ago
VITA-Group / WeLore
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…
☆49Updated 6 months ago
SalesforceAIResearch / GemFilter
☆85Updated 9 months ago
sramshetty / ShortGPT
Unofficial implementations of block/layer-wise pruning methods for LLMs.
☆72Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated last week
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆36Updated last year
shaochenze / PatchTrain
Code for paper "Patch-Level Training for Large Language Models"
☆88Updated 11 months ago
frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆45Updated 10 months ago
pprp / Pruner-Zero
[ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
☆94Updated 10 months ago
NormXU / Consistent-DynamicNTKRoPE
An Experiment on Dynamic NTK Scaling RoPE
☆64Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆117Updated last year
horseee / LLaMA-Pruning
Structural Pruning for LLaMA
☆54Updated 2 years ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆39Updated 11 months ago
IST-DASLab / SparseFinetuning
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆42Updated last year
liyucheng09 / llm-compressive
Longitudinal Evaluation of LLMs via Data Compression
☆33Updated last year
VILA-Lab / GBLM-Pruner
Are gradient information useful for pruning of LLMs?
☆47Updated last month
mathllm / MathCoder2
☆69Updated last year
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆33Updated last year
duterscmy / CD-MoE
Official PyTorch implementation of CD-MOE
☆12Updated 6 months ago
cofe-ai / MSG
Masked Structural Growth for 2x Faster Language Model Pre-training
☆25Updated last year