KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.
☆420Jun 22, 2026Updated last week
Alternatives and similar repositories for KVarN
Users that are interested in KVarN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- World's first Nintendo 3DS emulator for Apple devices based on Citra.☆18Apr 7, 2023Updated 3 years ago
- Mes projets publics MT5 et Python sont disponibles ici.☆22Jul 9, 2025Updated 11 months ago
- The visual way to organize your ideas & projects☆14Mar 21, 2025Updated last year
- small command line utilitiy to increase work flow of templating☆12Mar 21, 2021Updated 5 years ago
- A planetary geared FDM filament extruder developed to work with the BLV Cube designed by Ben Levi. The design has two branches, one that…☆17Mar 7, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Mar 15, 2026Updated 3 months ago
- Files shared in all my home directories☆27Jan 26, 2026Updated 5 months ago
- AI toolkit for professional and amateur oral historians☆13Oct 30, 2023Updated 2 years ago
- [READ ONLY] Subtree split of the SocialiteProviders/Twitter Provider (see SocialiteProviders/Providers)☆24Feb 21, 2026Updated 4 months ago
- restores a 50/50 split layout and starts 2 urxvt terminals when a new workspace is created☆13Mar 29, 2020Updated 6 years ago
- Must-know Cryptography concepts for web developers☆20Sep 1, 2025Updated 9 months ago
- ☆12Jun 20, 2016Updated 10 years ago
- Guacamole auth plugin with pasword less psk authentication. With config file per user.☆10Jun 14, 2023Updated 3 years ago
- Kubernetes RBAC authorizing HTTP proxy for a single upstream.☆12Apr 15, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implement…☆25Sep 17, 2025Updated 9 months ago
- Self-hosted web service and application to test internet speed of a remote server/computer via Speedtest-CLI☆12Feb 23, 2026Updated 4 months ago
- ☆14May 27, 2026Updated last month
- A sunpy plugin for accessing data in the Solar Orbiter Archive (SOAR).☆21Jun 3, 2026Updated 3 weeks ago
- Examples: jsonnet as a config language for Apache Aurora☆11Jan 29, 2016Updated 10 years ago
- Clear Linux guest boxes for Vagrant☆11Feb 3, 2022Updated 4 years ago
- ☆20May 18, 2026Updated last month
- Code for pre-training BabyLM baseline models.☆16Jun 19, 2023Updated 3 years ago
- OpenTelemetry Service☆16Jan 14, 2026Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Deploys a Dockerized Rails app to Kubernetes on Google, using GitHub Actions and Pulumi☆16Aug 19, 2023Updated 2 years ago
- Deploy Rocket.Chat on your Kubernetes cluster with ease.☆15Oct 9, 2018Updated 7 years ago
- TL2cgen (TreeLite 2 C GENerator) is a model compiler for decision tree models☆50May 25, 2026Updated last month
- A mechanism for doing incremental deploys with Bazel☆16Updated this week
- Python support for Keycloak☆14Apr 20, 2021Updated 5 years ago
- ☆42May 13, 2026Updated last month
- ☆11Mar 30, 2025Updated last year
- ☆10Jul 12, 2017Updated 8 years ago
- s(4)u for Windows☆49Dec 8, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Compact and Agent-Native MoE Training System☆209Updated this week
- 🐍 Copy passwords from Crome and send them via email,Python && win32crypt☆41Apr 15, 2016Updated 10 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Configuration flags for libraries and applications.☆12Oct 11, 2017Updated 8 years ago
- 2FA NGINX + Lua auth portal☆15Jan 16, 2018Updated 8 years ago
- A SapientML plugin of SapientMLGenerator☆11Apr 6, 2026Updated 2 months ago
- GGUF parser in Python☆29May 1, 2026Updated last month