Proof of concept: Exploiting temporal coherence in LLM inference-- delta encoding for KV cache compression and weight-skip prediction. Achieves F16-quality KV cache at Q4_0 compression ratios with zero perplexity loss on llama.cpp.
☆43Apr 10, 2026Updated this week
Alternatives and similar repositories for delta-compress-llm
Users that are interested in delta-compress-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-speaker separation, identification, diarization ALL-IN-ONE. It can isolate the target speaker from a conversation audio and do ASR.☆70Oct 13, 2025Updated 6 months ago
- Based on the implementation of Google's TurboQuant (ICLR 2026) — Quansloth brings elite KV cache compression to local LLM inference. Qua…☆76Updated this week
- 一键从 GitHub 跳转到 CodeWiki & DeepWiki & Zread 的油猴脚本 | A one-click Tampermonkey script for jumping from GitHub to CodeWiki & DeepWiki & Zrea…☆28Mar 25, 2026Updated 3 weeks ago
- Open Source Routing Machine for OpenStreetMap API Lib and App for Nim☆10Jun 6, 2019Updated 6 years ago
- ☆12Nov 23, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- My attempt to implement Metro theme for jQuery Mobile☆241Aug 13, 2012Updated 13 years ago
- It's ooey and gooey. No clue what this really is aside from a GUI framework basis?☆14Oct 1, 2024Updated last year
- An early warning device for the elderly when they fall☆15Apr 6, 2026Updated last week
- ISAAC PRNG implementation on Nim☆10Nov 16, 2017Updated 8 years ago
- HTTP spellcheck API powered by hunspell☆26Nov 20, 2014Updated 11 years ago
- GitHub Profile Frontpage☆11May 30, 2024Updated last year
- A simple demonstration of a GeoDjango application.☆52Feb 28, 2011Updated 15 years ago
- Play any game ROM with associated emulator in RetroArch on Linux☆17Jan 10, 2025Updated last year
- ☆11Jun 30, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tunee is your AI Music Partner. Create songs by chatting, manage multiple projects visually, and turn your music into stunning cinematic …☆39Sep 17, 2025Updated 6 months ago
- A Zepto Plugin for iOS like swipe navigation.☆87Oct 1, 2013Updated 12 years ago
- ☆21Jan 24, 2019Updated 7 years ago
- LaTeX template of NCKU Thesis☆11Nov 24, 2014Updated 11 years ago
- ☆10Nov 20, 2021Updated 4 years ago
- Awaitable threadpool for nim☆53May 16, 2025Updated 10 months ago
- Resolves UDP packets straight from TWSE with Linux C/C++ on a software level.☆83Feb 16, 2026Updated last month
- MCP server for generating draw.io diagrams using mxgraph. Create and manage diagrams programmatically or with AI and VSCode extension.☆30Sep 22, 2025Updated 6 months ago
- Asynchronous downloads manager library for Nim.☆12Jan 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- terminal ui widget based on illwill☆19Sep 9, 2025Updated 7 months ago
- Extracting text from the 10-K filings on SEC's EDGAR☆10May 25, 2014Updated 11 years ago
- 高鐵訂票驗證碼,使用Python、Keras (Tensorflow)、CNN☆39Sep 13, 2022Updated 3 years ago
- SMS Gateway for GoIP☆12May 14, 2022Updated 3 years ago
- A curated list of my GitHub stars!☆15Updated this week
- 🚀 Generate executable TypeScript tools from MCP servers with 98% token savings. Progressive loading pattern for AI agents. Production-re…☆38Updated this week
- A Link in Bio website configured with only a JSON file.☆17Feb 4, 2024Updated 2 years ago
- Synchronises Ableton Live to a live input such as microphone in the room or DJ playing record☆11Aug 14, 2014Updated 11 years ago
- A small PHP program to send SMS messages to a GOIP GSM Gateway using a simple restful API☆12Dec 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PTS test suite based on SNIA's Solid State Storage Performance Test Specification☆20Sep 12, 2018Updated 7 years ago
- SineKAN: Kolmogorov-Arnold Networks Using Sinusoidal Activation Functions☆15Dec 19, 2024Updated last year
- Python class for distributed video processing and encoding☆16Sep 4, 2021Updated 4 years ago
- Browser extension for account abstraction.☆10Sep 3, 2024Updated last year
- Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"☆11Jun 23, 2016Updated 9 years ago
- Random Serum Patches☆17Apr 21, 2018Updated 7 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 7 months ago