This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value (QKV) weight compression in low-precision Vision-Language Models (VLMs).
☆26May 16, 2026Updated last week
Alternatives and similar repositories for QSVD
Users that are interested in QSVD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jun 28, 2019Updated 6 years ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆53May 17, 2026Updated last week
- Standalone software tool for conducting spatial audio listening tests.☆15Nov 29, 2021Updated 4 years ago
- [ICRA 2025] Fast Global Localization on Neural Radiance Field☆17Jun 30, 2025Updated 10 months ago
- TA's implementation for the project of Computer Architecture and Intelligent Chip Design (23 Spring)☆10May 20, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆292Aug 28, 2025Updated 8 months ago
- ☆14Feb 5, 2025Updated last year
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 6 months ago
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆78Feb 27, 2026Updated 2 months ago
- [ICML 2026]A framework to compare low-bit integer and float-point formats☆79May 6, 2026Updated 2 weeks ago
- ☆27Apr 28, 2020Updated 6 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- GPU accelerated rigid body simulation with OpenGL and OpenCL.☆15Sep 14, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- Research in compressing convolutional layers of CNN using low-rank Tucker tensor decomposition☆11Nov 1, 2023Updated 2 years ago
- Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…☆30Jan 13, 2026Updated 4 months ago
- 李宏毅 (Hung-yi Lee) 机器学习 Machine Learning 2023 Spring☆14Dec 25, 2024Updated last year
- LLGS: Illuminating Gaussian Splatting via absorptance Modulation☆20Oct 16, 2024Updated last year
- [ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection☆158Feb 20, 2025Updated last year
- ☆12Jun 13, 2025Updated 11 months ago
- matlab code for hyperspectral target/anomaly detection☆10Dec 18, 2020Updated 5 years ago
- Deploy YOLOv8 in Unity using Sentis☆21Apr 20, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Training Transformers with knowledge localization (SGTM)☆51Jan 11, 2026Updated 4 months ago
- Weighted Reed Xiaoli Detector