haiquanlu/AlphaPruning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/haiquanlu/AlphaPruning)

haiquanlu / AlphaPruning

[NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models

☆34

Alternatives and similar repositories for AlphaPruning

Users that are interested in AlphaPruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

peijunallin / alphalora
View on GitHub
☆19Nov 10, 2024Updated last year
nsfzyzz / loss_landscape_taxonomy
View on GitHub
[NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228
☆20Jan 7, 2022Updated 4 years ago
nsfzyzz / Generalization_metrics_for_NLP
View on GitHub
[KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…
☆12Oct 17, 2022Updated 3 years ago
haiquanlu / Mix-Quant
View on GitHub
☆37May 21, 2026Updated 2 months ago
zhengyuan-xie / ECCV24_NeST
View on GitHub
[ECCV 2024] Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation
☆39Mar 3, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
bigglesworthnotacat / LLM-Steg
View on GitHub
[ICLR 2026 Oral] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
☆20Mar 22, 2026Updated 4 months ago
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
Henrymachiyu / FIPO
View on GitHub
This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.
☆18Jul 14, 2026Updated last week
amiya-special / AutoMIA
View on GitHub
☆15Apr 3, 2026Updated 3 months ago
Lexiang-Xiong / CAD
View on GitHub
[ECCV 2026] Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models
☆28Jun 20, 2026Updated last month
Yujun-Yan / Neural-Execution-Engines
View on GitHub
Code for Neural Execution Engines: Learning to Execute Subroutines
☆18Jan 11, 2021Updated 5 years ago
SempraETY / Pruning-via-Merging
View on GitHub
☆23Nov 26, 2024Updated last year
tsa18 / ConciseHint
View on GitHub
[Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
☆26Oct 1, 2025Updated 9 months ago
yangyifei729 / LaCo
View on GitHub
Official implementation for LaCo (EMNLP 2024 Findings)
☆22Oct 3, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhangchbin / awesome-continual-segmentation
View on GitHub
This repo is a collection of AWESOME things about continual semantic segmentation, including papers, code, demos, etc. Feel free to pull …
☆30Aug 21, 2024Updated last year
fscdc / dVoting
View on GitHub
[arXiv 2026] dVoting: Fast Voting for dLLMs
☆30Feb 13, 2026Updated 5 months ago
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆67Mar 31, 2026Updated 3 months ago
IST-DASLab / EvoPress
View on GitHub
☆43Jun 14, 2026Updated last month
A-suozhang / MixDQ
View on GitHub
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
☆14Nov 27, 2024Updated last year
caojiaolong / Awesome-Mamba
View on GitHub
Collect papers about Mamba (a selective state space model).
☆15Aug 6, 2024Updated last year
FishAndWasabi / Real-LOD
View on GitHub
Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"
☆34Apr 20, 2025Updated last year
imagination-research / LCSC
View on GitHub
[ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
☆16Feb 15, 2025Updated last year
HUST-AI-HYZ / FARMS
View on GitHub
Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias
☆45Nov 14, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Visual-AI / v-CLR
View on GitHub
[CVPR 2025 Highlight] v-CLR: View-Consistent Learning for Open-World Instance Segmentation
☆21May 31, 2026Updated last month
IBM / AutoVP
View on GitHub
[ICLR24] "AutoVP: An Automated Visual Prompting Framework and Benchmark" by Hsi-Ai Tsao*, Lei Hsiung*, Pin-Yu Chen, Sijia Liu, and Tsung-…
☆23Sep 18, 2025Updated 10 months ago
kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
pacman100 / accelerate-deepspeed-test
View on GitHub
Testing DeepSpeed integration in 🤗 Accelerate
☆11Jun 28, 2022Updated 4 years ago
RUCKBReasoning / LLM-Streamline
View on GitHub
Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"
☆43May 1, 2025Updated last year
VainF / In-Video-Instructions
View on GitHub
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 8 months ago
TUDa-HWAI / Basis_Sharing
View on GitHub
☆23Updated this week
czg1225 / VeriThinker
View on GitHub
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆67Sep 27, 2025Updated 9 months ago
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆42Feb 4, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
lzyhha / HSSL
View on GitHub
Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)
☆15May 2, 2025Updated last year
NVlabs / MaskLLM
View on GitHub
[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models
☆189Jan 1, 2025Updated last year
cvlab-yonsei / ALIFE
View on GitHub
An official implementation of "ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation" (NeurIPS 2022)…
☆49Dec 19, 2022Updated 3 years ago
LiQiiiii / Awesome-VLA-Safety
View on GitHub
[Arxiv] Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms
☆126Jul 13, 2026Updated last week
HaoHou-98 / SCGAN
View on GitHub
Semi-Cycled Generative Adversarial Networks for Real-World Face Super-Resolution
☆28Feb 15, 2023Updated 3 years ago
IBM / composite-adv
View on GitHub
[CVPR23] "Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations" by Lei Hsi…
☆23Sep 17, 2025Updated 10 months ago