llama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs with 15 tok/s TG, 221 tok/s PP.
☆190Jun 10, 2026Updated this week
Alternatives and similar repositories for llama.cpp-tq3
Users that are interested in llama.cpp-tq3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A conda-smithy repository for cling.☆12Apr 23, 2026Updated last month
- A program to automate testing open source LLMs for their political compass scores☆12Nov 28, 2023Updated 2 years ago
- ☆21Nov 28, 2025Updated 6 months ago
- Node to use PyTexturePacker☆23Feb 3, 2025Updated last year
- This tool can be used to convert binary xml file into human readable xml file and to convert normal xml file into binary xml file.☆13Feb 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ik_llama.cpp's Thireus fork with release builds for macOS/Windows/Ubuntu CPU, Vulkan and CUDA☆148Updated this week
- An alternative lightweight ChatGPT client that uses a cost-effective API and eliminates the risk of API token leak by running from an htm…☆13Jan 13, 2026Updated 5 months ago
- ☆128Dec 23, 2025Updated 5 months ago
- Make Twitch.tv look more like the old Twitch before the CSS changes.☆13Feb 18, 2021Updated 5 years ago
- Convert CocosBuilder exported .ccbi to Editable .ccb file format☆18Jan 22, 2017Updated 9 years ago
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆20Nov 21, 2024Updated last year
- ☆11Jun 21, 2023Updated 2 years ago
- ☆22Jun 13, 2024Updated 2 years ago
- ☆16May 7, 2018Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- High-Resolution Differential Z-Belt Mod for V0 (with optional Kirigami support)☆12May 22, 2022Updated 4 years ago
- TTS support with GGML☆242Oct 5, 2025Updated 8 months ago
- Standalone Implementation of abx2xml and xml2abx☆37Dec 7, 2025Updated 6 months ago
- ff-addon: Enables user-defined "natural mouse combinations" to trigger customizable functions.☆15Oct 21, 2016Updated 9 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- ☆70Updated this week
- Recover JPEG pictures when header is lost (corrupted, encrypted...)☆22May 31, 2017Updated 9 years ago
- Structured, temporal memory for AI agents.☆85May 18, 2026Updated 3 weeks ago
- A solution for mounting 9mm and 6mm gt2 belts to mgn12 carriage with M2-SHCS or 2mm pin.☆10Jan 28, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Prompt Jinja2 templates for LLMs☆35Jul 9, 2025Updated 11 months ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆27Apr 9, 2025Updated last year
- Hugging Face Download (Cache) Manager☆22Aug 7, 2022Updated 3 years ago
- ☆11Mar 25, 2023Updated 3 years ago
- Reverse image search utility based on perceptual hash algorithms☆23Nov 4, 2016Updated 9 years ago
- A mount for a standard 3x15mm cartridge thermistor on 1515 T-Slot Extrusion☆10Feb 23, 2023Updated 3 years ago
- The Official MIndsDB Extension for Docker Desktop.☆23Apr 23, 2026Updated last month
- A tool for the Jubilee printer using a Voron afterburner extruder☆10Jul 1, 2022Updated 3 years ago
- ☆18Dec 27, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Crossbar is an opensource microscale cantilever 3D printer.☆12Apr 1, 2022Updated 4 years ago
- ☆30Updated this week
- Steamroller is a custom adapter to use the OMG extruder on Voron and other printers.☆12Oct 30, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- SB头完美适配Orbiter2.0☆10Aug 25, 2022Updated 3 years ago
- [Self-hosted] A Model Context Protocol (MCP) server implementation that provides a web search capability over stdio transport. This serve…☆36Apr 30, 2025Updated last year
- NoteBookLM不好用,我自己手搓一个!Newbee-Notebook 新蜂阅读器:更棒的阅读体验与AI交互机制,持续更新中☆85Jun 7, 2026Updated last week