This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value (QKV) weight compression in low-precision Vision-Language Models (VLMs).
☆28May 16, 2026Updated last month
Alternatives and similar repositories for QSVD
Users that are interested in QSVD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vortex: Programmable Sparse Attention for Agents as Algorithm Designers☆63Jun 24, 2026Updated last week
- [ICRA 2025] Fast Global Localization on Neural Radiance Field☆17Jun 30, 2025Updated last year
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆296Aug 28, 2025Updated 10 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 7 months ago
- [ICML 2026]A framework to compare low-bit integer and float-point formats☆79May 6, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Research in compressing convolutional layers of CNN using low-rank Tucker tensor decomposition☆12Nov 1, 2023Updated 2 years ago
- Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…☆30Jan 13, 2026Updated 5 months ago
- LLGS: Illuminating Gaussian Splatting via absorptance Modulation☆20Oct 16, 2024Updated last year
- ☆13Jun 13, 2025Updated last year
- Training Transformers with knowledge localization (SGTM)☆54Jan 11, 2026Updated 5 months ago
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆91Oct 22, 2024Updated last year
- [ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.☆15Aug 25, 2023Updated 2 years ago
- ☆86May 2, 2026Updated 2 months ago
- ☆15Feb 26, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [TMLR 2026] Is Oracle Pruning the True Oracle?☆26Jun 20, 2026Updated 2 weeks ago
- ☆13May 8, 2023Updated 3 years ago
- [JAG'26] SpatialLLM: From Multi-modality Data to Urban Spatial Intelligence☆120Mar 5, 2026Updated 3 months ago
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆31Sep 29, 2025Updated 9 months ago
- The Ever-Evolving Science Exam☆52Jan 18, 2026Updated 5 months ago
- ☆11Jun 11, 2025Updated last year
- ☆13Jun 12, 2025Updated last year
- A Unified Framework for Benchmarking Generative Electrocardiogram-Language Models (ELMs)☆48Feb 23, 2026Updated 4 months ago
- An efficient distillation method for flow matching models☆27Feb 1, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LiveSecBench:动态中文大模型安全榜单☆28Mar 9, 2026Updated 3 months ago
- Data augmentation using OpenCV☆11Jan 12, 2017Updated 9 years ago
- [ICML 2026] Official Repo for Fast-SAM3D: 3Dfy Anything in Images but Faster☆168Jun 4, 2026Updated last month
- ☆52Mar 31, 2026Updated 3 months ago
- One-shot Global Localization through Semantic Distribution Feature Retrieval and Semantic Topological Histogram Registration☆19Feb 14, 2025Updated last year
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- A list of papers about point cloud based place recognition, also known as loop closure detection in SLAM (processing)☆10Jan 30, 2024Updated 2 years ago
- [CVPR2019] Fast Online Object Tracking and Segmentation: A Unifying Approach☆11Sep 18, 2020Updated 5 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆20Mar 31, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 — Carrying out CNN Channel Pruning in a White Box☆18Feb 15, 2022Updated 4 years ago
- Logging in with Scrapy☆14Jan 26, 2018Updated 8 years ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- [SIGGRAPH ASIA 2025] This is the official implementation of the SIGGRAPH ASIA 2025 : Hierarchical Neural Semantic Representation for 3D S…☆19Dec 21, 2025Updated 6 months ago
- ☆17Mar 22, 2024Updated 2 years ago
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆181Sep 19, 2025Updated 9 months ago
- [IEEE TSP 2021] “Robust Subspace Tracking with Missing Data and Outliers: Novel Algorithm with Convergence Guarantee”. IEEE Transactions …☆17Feb 16, 2023Updated 3 years ago