KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.
☆373Jun 8, 2026Updated this week
Alternatives and similar repositories for KVarN
Users that are interested in KVarN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- World's first Nintendo 3DS emulator for Apple devices based on Citra.☆18Apr 7, 2023Updated 3 years ago
- Mes projets publics MT5 et Python sont disponibles ici.☆22Jul 9, 2025Updated 10 months ago
- The visual way to organize your ideas & projects☆14Mar 21, 2025Updated last year
- small command line utilitiy to increase work flow of templating☆12Mar 21, 2021Updated 5 years ago
- Files shared in all my home directories☆28Jan 26, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AI toolkit for professional and amateur oral historians☆13Oct 30, 2023Updated 2 years ago
- Must-know Cryptography concepts for web developers☆20Sep 1, 2025Updated 9 months ago
- ☆12Jun 20, 2016Updated 9 years ago
- ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implement…☆25Sep 17, 2025Updated 8 months ago
- ☆14May 27, 2026Updated last week
- A sunpy plugin for accessing data in the Solar Orbiter Archive (SOAR).☆21Updated this week
- Examples: jsonnet as a config language for Apache Aurora☆11Jan 29, 2016Updated 10 years ago
- Compact and Agent-Native MoE Training System☆144Updated this week
- ☆20May 18, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for pre-training BabyLM baseline models.☆16Jun 19, 2023Updated 2 years ago
- OpenTelemetry Service☆16Jan 14, 2026Updated 4 months ago
- TL2cgen (TreeLite 2 C GENerator) is a model compiler for decision tree models☆48May 25, 2026Updated 2 weeks ago
- A mechanism for doing incremental deploys with Bazel☆16Updated this week
- ☆11Mar 30, 2025Updated last year
- ☆10Jul 12, 2017Updated 8 years ago
- s(4)u for Windows☆49Dec 8, 2020Updated 5 years ago
- 🐍 Copy passwords from Crome and send them via email,Python && win32crypt☆41Apr 15, 2016Updated 10 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Configuration flags for libraries and applications.☆12Oct 11, 2017Updated 8 years ago
- A SapientML plugin of SapientMLGenerator☆11Apr 6, 2026Updated 2 months ago
- GGUF parser in Python☆29May 1, 2026Updated last month
- Phabricator webhook for Prometheus Alertmanager☆17Jan 9, 2023Updated 3 years ago
- An LLM inference engine, written in C++☆20Mar 30, 2026Updated 2 months ago
- A simple C Thread pool implementation☆13Apr 10, 2020Updated 6 years ago
- Packer plugin used to generate SSH keys.☆24Dec 12, 2024Updated last year
- HMAC support for Lua with multiple algorithms, via OpenSSL and FFI☆23May 8, 2018Updated 8 years ago
- A dynamic GPU memory allocator, suitable for warp synchronized scenarios.☆11Aug 20, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ocr照片识别文字,包括裁剪图片,能识别中文和英文,是现有网上资源中识别率最好的☆14Sep 20, 2016Updated 9 years ago
- a ros node using face_net do face_recognition☆12Jul 27, 2016Updated 9 years ago
- ☆14Aug 25, 2023Updated 2 years ago
- markdown-it plugin for modifying tokens in a markdown document. It can for example modify content or attributes for certain type of eleme…☆24Jun 13, 2017Updated 8 years ago
- ☆41Jan 26, 2022Updated 4 years ago
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- 本书是《5G Mobile Networks : A Systems Approach》(https://5g.systemsapproach.org/)的中文版翻译。☆13Jun 26, 2022Updated 3 years ago