wangqinsi1/Dobi-SVD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangqinsi1/Dobi-SVD)

wangqinsi1 / Dobi-SVD

[ICLR 2025] Dobi-SVD : Differentiable SVD for LLM Compression and Some New Perspectives"

☆54

Alternatives and similar repositories for Dobi-SVD

Users that are interested in Dobi-SVD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wangqinsi1 / 2025-ICML-CoreMatching
View on GitHub
[ICML 2025] CoreMatching: Co-adaptive Sparse Inference Framework for Comprehensive Acceleration of Vision Language Model
☆16May 27, 2025Updated last year
ZHITENGLI / AdaSVD
View on GitHub
PyTorch code for our paper "AdaSVD: Adaptive Singular Value Decomposition for Large Language Models"
☆15Mar 9, 2025Updated last year
T2S-Bench / T2S-Bench
View on GitHub
This is Official implementation for T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasonin…
☆24Mar 5, 2026Updated 4 months ago
wangqinsi1 / CoreInfer
View on GitHub
This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…
☆18Oct 25, 2024Updated last year
Zishan-Shao / FlashSVD
View on GitHub
[AAAI 2026] Official implementation of "FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models". If you find this reposi…
☆17May 1, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HankYe / KVCOMM
View on GitHub
[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
☆17Nov 1, 2025Updated 8 months ago
wangqinsi1 / GAINRL
View on GitHub
[NeurIPS Spotlight 2025] Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.
☆83Sep 26, 2025Updated 10 months ago
Ting-Justin-Jiang / ZEUS
View on GitHub
[ACM MM 2026]⚡ZEUS accelerates your diffuser. Any modality. Any model. Any scheduler. https://yixiao-wang-stats.github.io/zeus/
☆20Jun 2, 2026Updated last month
SAI-Lab-NYU / QSVD
View on GitHub
This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …
☆28May 16, 2026Updated 2 months ago
Ting-Justin-Jiang / sada-icml
View on GitHub
[ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a uni…
☆43Jul 24, 2025Updated last year
Ah-miu / Dobi-SVD.page
View on GitHub
"Knock, knock!" "Who's there?" "Dobi."
☆17Aug 11, 2025Updated 11 months ago
wangqinsi1 / Vision-Zero
View on GitHub
[ICLR 2026] Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.
☆136Feb 6, 2026Updated 5 months ago
thu-nics / MBQ
View on GitHub
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
☆93Mar 17, 2025Updated last year
TUDa-HWAI / Basis_Sharing
View on GitHub
☆23Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Ah-miu / Ah-miu.github.io
View on GitHub
☆18Jul 7, 2026Updated 3 weeks ago
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year
JingyangXiang / DFRot
View on GitHub
[COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎：https://zhuanlan.zhihu.c…
☆30Mar 5, 2025Updated last year
Intelligent-Computing-Lab-Panda / GPTAQ
View on GitHub
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆93Jul 28, 2025Updated last year
Yuzhe-Fu / FractalCloud
View on GitHub
[HPCA 2026] FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing
☆22Apr 21, 2026Updated 3 months ago
DerrickYLJ / TidalDecode
View on GitHub
[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
☆57Aug 6, 2025Updated 11 months ago
BrotherHappy / OSTQuant
View on GitHub
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆94Apr 8, 2025Updated last year
CASIA-LMC-Lab / FLAP
View on GitHub
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
☆76Jan 6, 2024Updated 2 years ago
Xingyu-Zheng / FOEM
View on GitHub
(AAAI 2026) First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
☆16Apr 16, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HuangOwen / Awesome-LLM-Compression
View on GitHub
Awesome LLM compression research papers and tools.
☆1,855Jun 30, 2026Updated 3 weeks ago
SJTU-Storage-Lab / CacheSlide
View on GitHub
☆35Jan 27, 2026Updated 6 months ago
zyxxmu / DSnoT
View on GitHub
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆51Apr 9, 2024Updated 2 years ago
seanscott1991 / Duke_SQL4DQA
View on GitHub
☆16Nov 5, 2025Updated 8 months ago
ruikangliu / FlatQuant
View on GitHub
[ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"
☆223Nov 25, 2025Updated 8 months ago
Odysseusq / VLCache
View on GitHub
Official Repo for paper "VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference"
☆16Mar 28, 2026Updated 4 months ago
adreamwu / PTQ4DiT
View on GitHub
PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005
☆49Nov 8, 2024Updated last year
dnhkng / PCAonGPU
View on GitHub
A GPU-based Incremental PCA implementation.
☆32Feb 18, 2025Updated last year
StiphyJay / MQuant
View on GitHub
[ACM MM2025]: MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization
☆44Aug 13, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dellixx / TeAST
View on GitHub
[ACL 2023] TeAST: Temporal Knowledge Graph Embedding via Archimedean Spiral Timeline
☆12Mar 4, 2024Updated 2 years ago
parsa-epfl / quantization-sparsity-interplay
View on GitHub
This repo contains the code for studying the interplay between quantization and sparsity methods
☆26Feb 26, 2025Updated last year
colehawkins / bayesian-tensor-rank-determination
View on GitHub
☆13Dec 17, 2021Updated 4 years ago
alibaba / EfficientAI
View on GitHub
☆48May 9, 2026Updated 2 months ago
mohmdelsayed / HesScale
View on GitHub
Scalable Computation of Hessian Diagonals
☆14Jun 2, 2024Updated 2 years ago
ELM-Research / ECG-Language-Models
View on GitHub
A research-oriented training and evaluation framework for ECG-Language Models (ELMs)
☆16Updated this week
stephenqz / OATS
View on GitHub
Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition
☆20Apr 16, 2025Updated last year