RotorQuant: Clifford algebra vector quantization for LLM KV cache compression. 10-19x faster than TurboQuant, 44x fewer parameters.
☆161Mar 29, 2026Updated this week
Alternatives and similar repositories for rotorquant
Users that are interested in rotorquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Mar 19, 2026Updated last week
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆117Jul 27, 2025Updated 8 months ago
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- My Gen AI research☆11Jun 3, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- ☆14Sep 18, 2024Updated last year
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vector tile design information processing tool☆17Apr 22, 2025Updated 11 months ago
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Personnal collection of pipes and filters I use for open-webui☆26Mar 10, 2026Updated 2 weeks ago
- A modified version of searx (the privacy-respecting metasearch engine) to only search an allowlist of sites, to build functionality simil…☆19Sep 17, 2021Updated 4 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆21Jun 4, 2024Updated last year
- GGUF Quantization of any LLM.☆42Mar 4, 2024Updated 2 years ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆52Feb 10, 2026Updated last month
- Run NASA's General Mission Analysis Tool (GMAT) from Julia☆10Sep 3, 2020Updated 5 years ago
- Plugin for Open WebUI☆37Mar 12, 2026Updated 2 weeks ago
- a simple command line tool / package that prints the dependencies of a python project☆28Apr 6, 2018Updated 7 years ago
- ☆28Apr 17, 2025Updated 11 months ago
- Optional argument checks allow you to omit them when performance is critical.☆10Aug 31, 2022Updated 3 years ago
- Proxy server to Argo API, OpenAI format compatible☆20Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Object Detection with Transformers : DETR, Conditional DETR, Deformable DETR, Dynamic Head☆12Jan 22, 2023Updated 3 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 9 months ago
- A collection of example workflows for GitHub Actions that are used for Julia projects and packages.☆13Apr 28, 2020Updated 5 years ago
- furter development of the prairie openid server originally developed by barnraiser.com☆33Jun 24, 2016Updated 9 years ago
- Function calls with 50% less typing ;-)☆12Jul 29, 2021Updated 4 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 5 months ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆29Feb 13, 2026Updated last month
- tinyrobotics - A lightweight, fast and versatile C++ library for robotics.☆11Nov 22, 2023Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- `@code_costs`: a variant of `@code_typed` with estimated costs☆13Sep 1, 2020Updated 5 years ago
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- ChatPDF is a Streamlit app allowing users to query PDF & DOCX content via natural language. It indexes documents for conversational inter…☆17Sep 5, 2023Updated 2 years ago
- c++的一些基础知识总结☆10Oct 28, 2020Updated 5 years ago
- Execute npm scripts on one click in atom☆10Jun 20, 2017Updated 8 years ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- ESM front-end to 7-Zip, featuring alternative 7z CLI tool, binaries for Linux, Windows, Mac OSX, and seamlessly create 7zip SFX self extr…☆13Feb 14, 2026Updated last month