llama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs with 15 tok/s TG, 221 tok/s PP.
☆160Apr 24, 2026Updated last week
Alternatives and similar repositories for llama.cpp-tq3
Users that are interested in llama.cpp-tq3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆27Feb 3, 2026Updated 3 months ago
- A collection of various technical indicators implemented in LitScript☆10Apr 5, 2022Updated 4 years ago
- Unofficial Placement information bulletin for Manipal Institute of Technology☆10Jan 5, 2024Updated 2 years ago
- ☆23Aug 27, 2025Updated 8 months ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆30Dec 7, 2025Updated 4 months ago
- ik_llama.cpp's Thireus fork with release builds for macOS/Windows/Ubuntu CPU, Vulkan and CUDA☆122Updated this week
- SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)☆27Nov 3, 2025Updated 6 months ago
- ☆19Nov 28, 2025Updated 5 months ago
- Personal Knowledge Graph - User Memory and Personality from Digital Footprint☆24Mar 12, 2026Updated last month
- No Trialware☆10Mar 5, 2024Updated 2 years ago
- DP-HyperparamTuning offers an array of tools for fast and easy hypertuning of various hyperparameters for the DP-SGD algorithm.☆23Sep 27, 2021Updated 4 years ago
- ☆47Apr 26, 2026Updated last week
- ☆128Dec 23, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆20Nov 21, 2024Updated last year
- ☆11Jun 21, 2023Updated 2 years ago
- ☆20May 30, 2025Updated 11 months ago
- ☆17May 8, 2025Updated 11 months ago
- buju (布局) is a simple layout engine, it is a Nim port of layout.h.☆14Feb 6, 2026Updated 3 months ago
- ☆22Jun 13, 2024Updated last year
- ☆16Mar 2, 2022Updated 4 years ago
- a self improving operating system for intelligence☆60Updated this week
- High-Resolution Differential Z-Belt Mod for V0 (with optional Kirigami support)☆12May 22, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10Jan 22, 2023Updated 3 years ago
- Structured, temporal memory for AI agents.☆73Apr 8, 2026Updated 3 weeks ago
- Prompt Jinja2 templates for LLMs☆35Jul 9, 2025Updated 9 months ago
- PocketLang binding for Nim☆21Jul 19, 2024Updated last year
- Lightning fast ⚡ Responsive SVG candlestick chart generation in Javascript☆24Mar 8, 2022Updated 4 years ago
- ☆11Mar 25, 2023Updated 3 years ago
- A mount for a standard 3x15mm cartridge thermistor on 1515 T-Slot Extrusion☆10Feb 23, 2023Updated 3 years ago
- ☆38Feb 18, 2025Updated last year
- ☆13Aug 10, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Official MIndsDB Extension for Docker Desktop.☆23Apr 23, 2026Updated last week
- A tool for the Jubilee printer using a Voron afterburner extruder☆10Jul 1, 2022Updated 3 years ago
- ☆18Dec 27, 2025Updated 4 months ago
- The Crossbar is an opensource microscale cantilever 3D printer.☆12Apr 1, 2022Updated 4 years ago
- Steamroller is a custom adapter to use the OMG extruder on Voron and other printers.☆12Oct 30, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- ☆27Feb 12, 2026Updated 2 months ago