cenconq25/delta-compress-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cenconq25/delta-compress-llm)

cenconq25 / delta-compress-llm

Proof of concept: Exploiting temporal coherence in LLM inference-- delta encoding for KV cache compression and weight-skip prediction. Achieves F16-quality KV cache at Q4_0 compression ratios with zero perplexity loss on llama.cpp.

☆45

Alternatives and similar repositories for delta-compress-llm

Users that are interested in delta-compress-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

juancarlospaco / nim-osrm
View on GitHub
Open Source Routing Machine for OpenStreetMap API Lib and App for Nim
☆10Jun 6, 2019Updated 6 years ago
nikki93 / ng-public
View on GitHub
☆12Nov 23, 2020Updated 5 years ago
beef331 / gooey
View on GitHub
It's ooey and gooey. No clue what this really is aside from a GUI framework basis?
☆14Oct 1, 2024Updated last year
pragmagic / isaac
View on GitHub
ISAAC PRNG implementation on Nim
☆10Nov 16, 2017Updated 8 years ago
thingsiplay / enjoy
View on GitHub
Play any game ROM with associated emulator in RetroArch on Linux
☆17Jan 10, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lbcheng888 / 2DeFi
View on GitHub
☆12Jun 30, 2022Updated 3 years ago
yglukhov / asyncthreadpool
View on GitHub
Awaitable threadpool for nim
☆53May 16, 2025Updated last year
Patitotective / downit
View on GitHub
Asynchronous downloads manager library for Nim.
☆12Jan 12, 2024Updated 2 years ago
thiagodk / goip-sms-server
View on GitHub
SMS Gateway for GoIP
☆12May 14, 2022Updated 4 years ago
itsbrex / my-awesome-stars
View on GitHub
A curated list of my GitHub stars!
☆18Updated this week
iivorait / FSG-GOIP-snippet
View on GitHub
A small PHP program to send SMS messages to a GOIP GSM Gateway using a simple restful API
☆12Dec 28, 2024Updated last year
ruohoruotsi / BeatSeeker
View on GitHub
Synchronises Ableton Live to a live input such as microphone in the room or DJ playing record
☆11Aug 14, 2014Updated 11 years ago
ereinha / SineKAN
View on GitHub
SineKAN: Kolmogorov-Arnold Networks Using Sinusoidal Activation Functions
☆16Dec 19, 2024Updated last year
jaar23 / tui_widget
View on GitHub
terminal ui widget based on illwill
☆20May 18, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lcchen008 / irreg-simd
View on GitHub
Artifact of paper "Exploiting Recent SIMD Architectural Advances for Irregular Applications"
☆11Jun 23, 2016Updated 9 years ago
RS2002 / Adversarial-MidiBERT
View on GitHub
[ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …
☆18Aug 17, 2025Updated 9 months ago
Miserlou / SynthRecipies
View on GitHub
Random Serum Patches
☆17Apr 21, 2018Updated 8 years ago
Patitotective / tinydialogs
View on GitHub
Tiny file dialogs bindings for Nim
☆21Oct 29, 2025Updated 6 months ago
faraway1nspace / DrumClassifer-CNN-LSTM
View on GitHub
Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)
☆14Aug 20, 2020Updated 5 years ago
Cabbagito / Generating-South-Park-Episodes
View on GitHub
Screw You Guys I'm Going Home
☆12Jan 28, 2023Updated 3 years ago
Outsmart-OOH / ooh_open_direct
View on GitHub
Open Direct (OOH) Schema and examples
☆16Jun 7, 2023Updated 2 years ago
cnrai / llm-perfbench
View on GitHub
☆15Apr 3, 2025Updated last year
correaelias / TypeUtils
View on GitHub
Utility functions based on Object Pascal's runtime type information (RTTI)
☆14Oct 15, 2018Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cocoindex-io / cocoindex-claude
View on GitHub
✨ CocoIndex Claude Code Skill ✨
☆66Feb 22, 2026Updated 3 months ago
guibar64 / aossoa
View on GitHub
Use a Structure of Arrays like an Array of Structures
☆18Nov 7, 2021Updated 4 years ago
xunit / testextensions.xunit
View on GitHub
☆16Jan 10, 2025Updated last year
moigagoo / shopapp
View on GitHub
Demo webapp built with Jester, Norm, and Norman.
☆20Oct 25, 2023Updated 2 years ago
dashscope / claude-code-config
View on GitHub
dashscope config for Claude code
☆33Aug 7, 2025Updated 9 months ago
DaveAldon / Distributed-ML-with-MLX
View on GitHub
🍎👉🍏 Everything you need in order to get started building distributed machine learning with Apple's MLX
☆19Aug 9, 2025Updated 9 months ago
craftcms / commerce-digital-products
View on GitHub
Sell digital products with Craft Commerce.
☆18Aug 11, 2025Updated 9 months ago
zguo0525 / API-Pack
View on GitHub
☆18Aug 14, 2024Updated last year
chrisheller / docker-nim-cross
View on GitHub
Dockerfile for Nim lang
☆20Jan 23, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
upscale-sdk / UpScale
View on GitHub
UpScale SDK enables the development of applications with strict timing and high-performance requirements
☆21Feb 5, 2024Updated 2 years ago
seungheondoh / musical-word-embedding
View on GitHub
Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]
☆28Apr 23, 2024Updated 2 years ago
romjacket / skeletonKey
View on GitHub
unified interface for ROMs, emulators and frontends.
☆22Jan 17, 2022Updated 4 years ago
ffont / shepherd
View on GitHub
☆16Jan 23, 2023Updated 3 years ago
lbl-camera / tomocam
View on GitHub
Reconstructing tomography data. Faster!
☆15Mar 23, 2026Updated 2 months ago
patelnav / ctrlspeak
View on GitHub
Turn your voice into text with a triple-tap — minimal, fast, and macOS-native.
☆43Apr 1, 2026Updated last month
QData / GaKCo-SVM
View on GitHub
ECML16: GaKCo: a Fast Gapped k-mer string Kernel using Counting
☆14Aug 28, 2019Updated 6 years ago