zimingyy / SubZeroLinks
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)
☆15Updated last year
Alternatives and similar repositories for SubZero
Users that are interested in SubZero are comparing it to the libraries listed below
Sorting:
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆23Updated 11 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆70Updated last year
- A library for calculating the FLOPs in the forward() process based on torch.fx☆137Updated last month
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆15Updated 9 months ago
- 😎 A curated list of tensor decomposition resources for model compression.☆104Updated 3 weeks ago
- Awesome-Low-Rank-Adaptation☆128Updated last year
- ☆36Updated 3 years ago
- This repository contains low-bit quantization papers from 2020 to 2025 on top conference.☆95Updated 4 months ago
- YAECL: Yet Another Entropy Coding Library for Neural Compression Research, with Arithmetic Coding and Asymmetric Numeral Systems support☆39Updated 2 years ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆138Updated 3 years ago
- Official code implementation for 2025 ICLR accepted paper "Dobi-SVD : Differentiable SVD for LLM Compression and Some New Perspectives"☆50Updated 3 months ago
- ☆20Updated last year
- Python logging package for easy reproducible experimenting in research☆40Updated 6 months ago
- ☆63Updated last year
- An Numpy and PyTorch Implementation of CKA-similarity with CUDA support☆94Updated 4 years ago
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆80Updated last year
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)☆141Updated 2 years ago
- The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink…☆834Updated last month
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆233Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆260Updated last year
- Code for ICCV23 paper "Automatic network pruning via Hilbert Schmidt independence criterion lasso under information bottleneck principle"☆18Updated 2 years ago
- DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)☆12Updated this week
- PyTorch code for our paper "Resource-Adaptive Federated Learning with All-In-One Neural Composition" (NeurIPS2022)☆19Updated 3 years ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆127Updated 4 months ago
- [TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆482Updated 6 months ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆293Updated last year
- Neural Distributed Image Compression using Common Information (NDIC) [DCC 2022]☆35Updated 3 years ago
- ☆56Updated last year
- [NeurIPS 2023] Structural Pruning for Diffusion Models☆216Updated last year
- Curated list of methods that focuses on improving the efficiency of diffusion models☆44Updated last year