[ICLR 2025] Dobi-SVD : Differentiable SVD for LLM Compression and Some New Perspectives"
β53Oct 19, 2025Updated 7 months ago
Alternatives and similar repositories for Dobi-SVD
Users that are interested in Dobi-SVD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systemsβ16Nov 1, 2025Updated 6 months ago
- [ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. ππAccelerating off-the-shelf diffusion model with a uniβ¦β43Jul 24, 2025Updated 10 months ago
- β16Nov 5, 2025Updated 6 months ago
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'β18Apr 24, 2025Updated last year
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"β12Jun 11, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; η₯δΉοΌhttps://zhuanlan.zhihu.cβ¦β30Mar 5, 2025Updated last year
- Activation-aware Singular Value Decomposition for Compressing Large Language Modelsβ92Oct 22, 2024Updated last year
- Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]β58Mar 5, 2025Updated last year
- [ACL 2023] TeAST: Temporal Knowledge Graph Embedding via Archimedean Spiral Timelineβ12Mar 4, 2024Updated 2 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2β¦β14Dec 7, 2024Updated last year
- [ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Modelsβ62Aug 9, 2024Updated last year
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMsβ45Jun 17, 2025Updated 11 months ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".β25Mar 16, 2025Updated last year
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.β25Dec 10, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β25Apr 13, 2025Updated last year
- Causal Inference-based Debiasing Framework for Knowledge Graph Completionβ13Mar 19, 2024Updated 2 years ago
- A small library for doing fluid simulation with neural networks.β23Dec 13, 2021Updated 4 years ago
- β10Jan 28, 2024Updated 2 years ago
- Multi-dimensional analysis of orthogonal safety directions in LLM alignmentβ22Mar 20, 2025Updated last year
- β48May 9, 2026Updated 3 weeks ago
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attentionβ53Aug 6, 2025Updated 9 months ago
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value β¦β26May 16, 2026Updated 2 weeks ago
- β36Mar 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β11May 19, 2025Updated last year
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategyβ73Jan 22, 2025Updated last year
- Elucidated Dataset Condensation (NeurIPS 2024)β20Oct 5, 2024Updated last year
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"β23Sep 1, 2025Updated 8 months ago
- β78Jun 28, 2025Updated 11 months ago
- β11Apr 5, 2023Updated 3 years ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Timeβ89Jun 10, 2025Updated 11 months ago
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"β13Jun 7, 2023Updated 2 years ago
- Torch2Chip (MLSys, 2024)β56Apr 2, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β13Jan 7, 2025Updated last year
- β11Sep 16, 2023Updated 2 years ago
- β12Dec 26, 2024Updated last year
- A repo of a modified version of Diffusion Transformerβ51Sep 14, 2025Updated 8 months ago
- β14Jan 15, 2026Updated 4 months ago
- This is the Pytorch implementation of paper--Training deep neural-networks using a noise adaptation layer.β10Apr 18, 2021Updated 5 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inferenceβ10Dec 15, 2024Updated last year