llama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs with 15 tok/s TG, 221 tok/s PP.
☆188May 19, 2026Updated last week
Alternatives and similar repositories for llama.cpp-tq3
Users that are interested in llama.cpp-tq3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆241May 19, 2026Updated last week
- A collection of various technical indicators implemented in LitScript☆10Apr 5, 2022Updated 4 years ago
- TradingLite - LitScript Documentation☆13Jan 31, 2020Updated 6 years ago
- ☆30Dec 7, 2025Updated 5 months ago
- Title database for Ninty Launcher.☆11Feb 20, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A lightweight BitTorrent client built with C++, Qt 6, and libtorrent-rasterbar.☆42May 19, 2026Updated last week
- Coding Introduction / Tutorial, mostly by writing small little fun games☆24Jun 25, 2025Updated 11 months ago
- raylib + LuaJIT + Yuescript☆12Mar 1, 2022Updated 4 years ago
- ☆20Nov 28, 2025Updated 5 months ago
- python interface for mlc chat cli☆14May 7, 2023Updated 3 years ago
- This tool can be used to convert binary xml file into human readable xml file and to convert normal xml file into binary xml file.☆13Feb 7, 2024Updated 2 years ago
- No Trialware☆10Mar 5, 2024Updated 2 years ago
- ik_llama.cpp's Thireus fork with release builds for macOS/Windows/Ubuntu CPU, Vulkan and CUDA☆138Updated this week
- A decentralized, k-ordered id generation service in golang☆15Jul 30, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Soluções para o Advent of Code 2022☆12Jan 2, 2023Updated 3 years ago
- Streaming CSV parser with no dependencies and has a batch event for lower memory processing in batches as well as a reducer for doing agg…☆13Aug 2, 2025Updated 9 months ago
- Make Twitch.tv look more like the old Twitch before the CSS changes.☆13Feb 18, 2021Updated 5 years ago
- Convert CocosBuilder exported .ccbi to Editable .ccb file format☆16Jan 22, 2017Updated 9 years ago
- create your own game-engine with just lua for game boy advance☆12May 22, 2025Updated last year
- ☆11Jun 21, 2023Updated 2 years ago
- ☆48May 18, 2026Updated last week
- ☆20May 30, 2025Updated 11 months ago
- ☆22Jun 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16May 7, 2018Updated 8 years ago
- a self improving operating system for intelligence☆60Updated this week
- Just a place to dump some project binaries. No code here.☆18Apr 16, 2026Updated last month
- Recover JPEG pictures when header is lost (corrupted, encrypted...)☆22May 31, 2017Updated 8 years ago
- Structured, temporal memory for AI agents.☆79May 18, 2026Updated last week
- A solution for mounting 9mm and 6mm gt2 belts to mgn12 carriage with M2-SHCS or 2mm pin.☆10Jan 28, 2025Updated last year
- Spreadsheets☆102Mar 17, 2026Updated 2 months ago
- Go implementation of the LIFX bulb protocol, including a command line client, a client library, and a debug-oriented traffic snooper☆18Dec 15, 2015Updated 10 years ago
- ComfyUI nodes for transcription on audio or video input.☆35Apr 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Golang SDK for Google actions☆11Jun 29, 2018Updated 7 years ago
- Oauth library for Lithium☆21May 17, 2013Updated 13 years ago
- PocketLang binding for Nim☆21Jul 19, 2024Updated last year
- ☆17Jun 14, 2018Updated 7 years ago
- Lightning fast ⚡ Responsive SVG candlestick chart generation in Javascript☆24Mar 8, 2022Updated 4 years ago
- ☆11Mar 25, 2023Updated 3 years ago
- Node.js wrapper for the Optimizely API☆10Mar 12, 2018Updated 8 years ago