Open-source implementation of Google's TurboQuant (ICLR 2026) — KV cache compression to 2.5–4 bits with near-zero quality loss. 3.8–5.7x memory reduction on Mistral-7B, no training required.
☆50Mar 29, 2026Updated last month
Alternatives and similar repositories for turboquant
Users that are interested in turboquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Essential information for the 8-bit part of ECE-231☆24May 3, 2025Updated 11 months ago
- ☆12Jun 10, 2024Updated last year
- Transmitter and receiver sketches for Arduino for remote servo PWM control, such as electronic skateboard or rc car☆19Oct 9, 2015Updated 10 years ago
- After flashing, installs the minimum development environment☆24Sep 4, 2025Updated 7 months ago
- Heavy3 Code Audit: Agent skill that uses multi-model consensus to review plans, code, and PRs for coding agents☆45Apr 18, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LLM-powered macOS automation agent. Control Mail, Calendar, Reminders via natural language using AppleScript. Telegram voice commands, br…☆26Mar 31, 2026Updated 3 weeks ago
- Bypasses VMProtect's VMWare & VMWare Tools detection trough user-mode API hooks.☆26Aug 3, 2024Updated last year
- ☆22Aug 23, 2022Updated 3 years ago
- ☆36Updated this week
- Podcast/ YouTube video → Transcript!☆41Feb 17, 2026Updated 2 months ago
- Open-Source Intelligent Command Layer☆92Apr 16, 2026Updated last week
- ☆11Apr 11, 2020Updated 6 years ago
- Crawl Google web search result and get text from the url that google give us☆10Sep 17, 2015Updated 10 years ago
- A "standard library" of Triton kernels.☆22Oct 2, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tool for working with PS4 .pkgs☆20Oct 13, 2023Updated 2 years ago
- Video production for developers☆39Mar 19, 2026Updated last month
- ☆11Feb 3, 2019Updated 7 years ago
- A bootstrapped framework that uses CodeIgniter, Twitter Bootstrap, Grunt, Bower and various other components to help you start a website.☆32Jul 10, 2015Updated 10 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆39Mar 8, 2026Updated last month
- ☆33Oct 22, 2022Updated 3 years ago
- Advances in recent large vision language models (LVLMs)☆15Sep 23, 2024Updated last year
- A lightweight chat interface for interacting with local models, featuring persistent memory using a seamless SQLite database to store you…☆34Sep 15, 2025Updated 7 months ago
- GHUStereo models are novel real-time stereo matching architectures with a low computation complexity characterized by compact cost volum…☆31Dec 14, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Predicting emotions on Android☆12Nov 26, 2020Updated 5 years ago
- ☆13Feb 27, 2024Updated 2 years ago
- Release repository for Social Archiver - Archive social media posts from 8 platforms into Obsidian☆104Updated this week
- Official implementation of "Attention-aware semantic communications for collaborative inference” (IEEE IoTJ 2024)☆15Jan 22, 2026Updated 3 months ago
- Colorize a lidar pointcloud using synchronized camera images☆14Aug 10, 2024Updated last year
- [JBHI 2024] Self-supervised pre-training on ECG collected in the wild☆15Nov 14, 2023Updated 2 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 4 months ago
- Neural ODE Transformers (ICLR 2025)☆18Sep 6, 2025Updated 7 months ago
- A Codeigniter helper to generate 'On-The-Fly' image thumbnails.☆12Oct 19, 2012Updated 13 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An efficent (O(1)) algorithm to extract numbers from a non-uniform discrete probability distribution☆10Jun 19, 2019Updated 6 years ago
- ☆55Jul 1, 2025Updated 9 months ago
- ☆19Sep 13, 2022Updated 3 years ago
- A robust shell script for automated backup and restore of n8n workflows, credentials, and environment variables to GitHub. Supports inter…☆59Sep 25, 2025Updated 7 months ago
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆15May 6, 2025Updated 11 months ago
- ☆19Jul 24, 2025Updated 9 months ago
- ☆18Nov 19, 2017Updated 8 years ago