Lucky-Lance/SPP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Lucky-Lance/SPP)

Lucky-Lance / SPP

[ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

☆22

Alternatives and similar repositories for SPP

Users that are interested in SPP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JingXuTHU / Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning
View on GitHub
☆14May 4, 2024Updated 2 years ago
Lucky-Lance / TerDiT
View on GitHub
TerDiT: Ternary Diffusion Models with Transformers
☆76Jun 17, 2024Updated 2 years ago
SalesforceAIResearch / ThinK
View on GitHub
ThinK: Thinner Key Cache by Query-Driven Pruning
☆30Jun 2, 2026Updated last month
mathllm / Step-Controlled_DPO
View on GitHub
☆23Jul 5, 2024Updated 2 years ago
Qualcomm-AI-research / llm-surgeon
View on GitHub
☆35May 24, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Lucky-Lance / Expert_Sparsity
View on GitHub
[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
☆123May 24, 2024Updated 2 years ago
shaoyiHusky / SparseProgressiveDistillation
View on GitHub
☆12Aug 22, 2023Updated 2 years ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
UtkarshSaxena1 / EigenAttn
View on GitHub
☆20Oct 13, 2024Updated last year
BradMcDanel / sdgp
View on GitHub
☆10Feb 1, 2022Updated 4 years ago
rockywind / ADD
View on GitHub
☆11Nov 21, 2022Updated 3 years ago
UCSB-AI / SafeKey
View on GitHub
[EMNLP 2025] Official code for the paper "SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning"
☆16May 12, 2026Updated 2 months ago
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated last year
UCSC-VLAA / Complex-Edit
View on GitHub
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆29Apr 22, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Cornell-RelaxML / Hyperdimensional-Computing
View on GitHub
Official implementation for the paper "Understanding Hyperdimensional Computing for Parallel Single-Pass Learning"
☆25Jun 10, 2023Updated 3 years ago
RUCKBReasoning / LLM-Streamline
View on GitHub
Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"
☆43May 1, 2025Updated last year
InternScience / TrustGeoGen
View on GitHub
Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"
☆23Sep 1, 2025Updated 10 months ago
IST-DASLab / EvoPress
View on GitHub
☆43Jun 14, 2026Updated last month
HPC-Fortran2CPP / OpenMP-Fortran-CPP-Translation
View on GitHub
This repo contains the dataset for paper: Creating a Dataset Supporting Translation Between OpenMP Fortran and C++ Code
☆15Dec 1, 2023Updated 2 years ago
MarkXCloud / CSpD
View on GitHub
The official repo of continuous speculative decoding
☆36Mar 28, 2025Updated last year
ruz048 / AutoLoRA
View on GitHub
☆10Apr 16, 2024Updated 2 years ago
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆50Apr 9, 2024Updated 2 years ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
AminKaramlou / QNLG
View on GitHub
Contains the codebase for Quantum Natural Language Generation project
☆23Nov 2, 2022Updated 3 years ago
HIT-SIRS / CroBIM
View on GitHub
☆23Nov 29, 2024Updated last year
SawyDust1228 / HSIC-DKL-Yield-Estimation
View on GitHub
[ASPDAC23] High Dimensional Yield Estimation using Shrinkage Deep Features and Maximization of Integral Entropy Reduction
☆14Oct 9, 2022Updated 3 years ago
Olivia-fsm / DoGE
View on GitHub
Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"
☆21Feb 29, 2024Updated 2 years ago
IST-DASLab / Sparse-Marlin
View on GitHub
Boosting 4-bit inference kernels with 2:4 Sparsity
☆96Sep 4, 2024Updated last year
BaohaoLiao / frac-cot
View on GitHub
[COLM 2026] An efficient 3D sampling method for long-CoT LLM.
☆16May 25, 2025Updated last year
UCSC-VLAA / CLIPS
View on GitHub
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆40Apr 18, 2025Updated last year
asalarpour / Point_GN
View on GitHub
Official WACV 2025 code for Point-GN: A non-parametric, training-free method for 3D point cloud classification using Gaussian Positional …
☆15Jul 22, 2025Updated last year
OmniMMI / M4
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆18Apr 2, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
spcl / sten
View on GitHub
Sparsity support for PyTorch
☆39Mar 22, 2025Updated last year
yokoxue / LpDL
View on GitHub
The codes are for the paper: ``Complete Dictionary Learning via \ell_p-norm Maximization'',Yifei Shen∗ , Ye Xue∗ , Jun Zhang , Khaled B. …
☆11Nov 21, 2020Updated 5 years ago
sai-prasanna / bert-experiments
View on GitHub
☆19Oct 6, 2020Updated 5 years ago
WJ-Chang-42 / ASTransformer
View on GitHub
☆17Oct 21, 2021Updated 4 years ago
ZrrSkywalker / MonoDETR-MV
View on GitHub
The multi-view version of MonoDETR on nuScenes dataset
☆21Nov 4, 2022Updated 3 years ago
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
roymiles / VeLoRA
View on GitHub
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆22Oct 15, 2024Updated last year