Open-source implementation of Google's TurboQuant (ICLR 2026) — KV cache compression to 2.5–4 bits with near-zero quality loss. 3.8–5.7x memory reduction on Mistral-7B, no training required.
☆46Mar 29, 2026Updated last week
Alternatives and similar repositories for turboquant
Users that are interested in turboquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Heavy3 Code Audit: Agent skill that uses multi-model consensus to review plans, code, and PRs for coding agents☆45Mar 18, 2026Updated 3 weeks ago
- LLM-powered macOS automation agent. Control Mail, Calendar, Reminders via natural language using AppleScript. Telegram voice commands, br…☆26Mar 31, 2026Updated last week
- ☆29Updated this week
- Podcast/ YouTube video → Transcript!☆42Feb 17, 2026Updated last month
- Open-Source Intelligent Command Layer☆90Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Apr 11, 2020Updated 5 years ago
- A "standard library" of Triton kernels.☆22Oct 2, 2025Updated 6 months ago
- Crawl Google web search result and get text from the url that google give us☆10Sep 17, 2015Updated 10 years ago
- Video production for developers☆38Mar 19, 2026Updated 3 weeks ago
- ☆11Feb 3, 2019Updated 7 years ago
- A bootstrapped framework that uses CodeIgniter, Twitter Bootstrap, Grunt, Bower and various other components to help you start a website.☆32Jul 10, 2015Updated 10 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated last month
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- Release repository for Social Archiver - Archive social media posts from 8 platforms into Obsidian☆95Apr 1, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A lightweight chat interface for interacting with local models, featuring persistent memory using a seamless SQLite database to store you…☆32Sep 15, 2025Updated 6 months ago
- GHUStereo models are novel real-time stereo matching architectures with a low computation complexity characterized by compact cost volum…☆30Dec 14, 2025Updated 3 months ago
- Predicting emotions on Android☆12Nov 26, 2020Updated 5 years ago
- ☆13Feb 27, 2024Updated 2 years ago
- Official implementation of "Attention-aware semantic communications for collaborative inference” (IEEE IoTJ 2024)☆15Jan 22, 2026Updated 2 months ago
- Colorize a lidar pointcloud using synchronized camera images☆14Aug 10, 2024Updated last year
- [JBHI 2024] Self-supervised pre-training on ECG collected in the wild☆15Nov 14, 2023Updated 2 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 4 months ago
- Neural ODE Transformers (ICLR 2025)☆18Sep 6, 2025Updated 7 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A Codeigniter helper to generate 'On-The-Fly' image thumbnails.☆12Oct 19, 2012Updated 13 years ago
- An efficent (O(1)) algorithm to extract numbers from a non-uniform discrete probability distribution☆10Jun 19, 2019Updated 6 years ago
- ☆19Sep 13, 2022Updated 3 years ago
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆15May 6, 2025Updated 11 months ago
- ☆19Jul 24, 2025Updated 8 months ago
- ☆18Nov 19, 2017Updated 8 years ago
- A MCP (Model Context Protocol) server that provides automated GUI testing and control capabilities through PyAutoGUI.☆42Apr 2, 2025Updated last year
- [ARCHIVED] Contents migrated to monorepo: https://github.com/Kurento/kurento☆14Jan 11, 2023Updated 3 years ago
- A simple application written by Flutter, guid people how to sort waste.☆13Jun 30, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] "DepthCues: Evaluating Monocular Depth Perception in Large Vision Models", Duolikun Danier, Mehmet Aygün, Changjian Li, Hakan…☆21Mar 17, 2025Updated last year
- 自己总结的iOS、mac开源项目及库☆15Oct 23, 2015Updated 10 years ago
- ☆23Jan 5, 2025Updated last year
- serial for ros2☆21Mar 27, 2020Updated 6 years ago
- Give Claude/Cursor email powers. 27 MCP tools — inbox, send, reply, contacts, search. Free, no signup.☆599Mar 13, 2026Updated 3 weeks ago
- [TPAMI 2025] Implementation of "Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution"☆15Mar 27, 2025Updated last year
- A list of pytorch toolkits for CV application.☆17Jun 20, 2019Updated 6 years ago