From-scratch PyTorch implementation of Google's TurboQuant (ICLR 2026) for LLM KV cache compression. 5x compression at 3-bit with 99.5% attention fidelity.
☆436Mar 25, 2026Updated this week
Alternatives and similar repositories for turboquant-pytorch
Users that are interested in turboquant-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kakao Mobility MCP Server for directions and transit information☆10Sep 14, 2025Updated 6 months ago
- An MCP server that provides AI assistants with screenshot capabilities — both web page capture via Puppeteer and cross-platform system sc…☆18Mar 9, 2026Updated 2 weeks ago
- A simple wrapper to bring Auggie in to your development lifecycle.☆34Dec 8, 2025Updated 3 months ago
- Ace-Step Dataset Generator☆23Sep 27, 2025Updated 6 months ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- A ComfyUI integration to use LMStudio for extending prompts.☆23Apr 30, 2025Updated 10 months ago
- This repository provides FlashPortrait custom nodes for ComfyUI.☆26Dec 29, 2025Updated 2 months ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- ☆60Mar 16, 2026Updated last week
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆27Oct 20, 2025Updated 5 months ago
- A library for training crosscoders☆16May 28, 2025Updated 10 months ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 6 months ago
- Use AI to automatically turn any meeting or recording into a structured summary, you can use your favourite LLM APIs or run the entire pr…☆23Jan 20, 2025Updated last year
- AWS KR Tech Blog: 'Amazon Bedrock으로 30분 만에 멀티모달 RAG 챗봇 구축하기 실전 가이드' sample code☆13Feb 15, 2025Updated last year
- OAuth Login for Gradio. Supports multiple identity providers.☆16Jan 20, 2025Updated last year
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- ☆30Aug 25, 2025Updated 7 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Versor: Stop Projecting, Start Rotating. GBN (Geometric Blade Network)☆59Updated this week
- ☆25Feb 10, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- jQuery, React and Streamlit applications written by LLMs☆16Dec 24, 2023Updated 2 years ago
- Newsdata.io Official Python Client☆14Jan 14, 2026Updated 2 months ago
- Slidev implementation☆19Dec 15, 2025Updated 3 months ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- MCP Server for Ghidra. Exposes tools to be used by AI-powered reverse engineers.☆16Mar 29, 2025Updated 11 months ago
- This after-effects script helps users to build composition structure for twixtor effect over one or more layers with only a single click,…☆13Mar 20, 2022Updated 4 years ago
- This repository has a tool and an API for Saudi CERT alerts. Its goal is to help improve the level of cybersecurity awareness in Saudi Ar…☆13Nov 16, 2023Updated 2 years ago
- Teaching AI to play the classic text adventure Zork using Large Language Models☆36Dec 21, 2025Updated 3 months ago
- Python scripts for WIDER FACE Evaluation☆10May 25, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 4 months ago
- Low-level Machine Learning Library☆25Feb 1, 2025Updated last year
- Official Repository of the Deep Diacritization Paper☆17Dec 16, 2020Updated 5 years ago
- ☆17Jul 11, 2023Updated 2 years ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆19Jul 11, 2024Updated last year
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Apr 18, 2024Updated last year
- reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…☆12Mar 6, 2024Updated 2 years ago