llama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs with 15 tok/s TG, 221 tok/s PP.
☆78Apr 13, 2026Updated this week
Alternatives and similar repositories for llama.cpp-tq3
Users that are interested in llama.cpp-tq3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extending NERDA Library for Continual Learning☆11Mar 31, 2024Updated 2 years ago
- ☆29Jul 15, 2025Updated 9 months ago
- ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture☆25Feb 3, 2026Updated 2 months ago
- A collection of various technical indicators implemented in LitScript☆10Apr 5, 2022Updated 4 years ago
- A bash script used to install the requied stack for running Jesse.☆15Feb 23, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A collection of various technical indicators implemented in PineScript☆10Dec 3, 2019Updated 6 years ago
- Unofficial Placement information bulletin for Manipal Institute of Technology☆10Jan 5, 2024Updated 2 years ago
- A conda-smithy repository for cling.☆12Mar 17, 2026Updated 3 weeks ago
- The code used for the documentation website.☆15Apr 7, 2026Updated last week
- TradingLite - LitScript Documentation☆13Jan 31, 2020Updated 6 years ago
- ☆23Aug 27, 2025Updated 7 months ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 5 years ago
- ☆30Dec 7, 2025Updated 4 months ago
- Pinescript for VS-Code☆19Jun 10, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Developing a Toolkit for On-chain analysis of Blockchains☆17Oct 19, 2019Updated 6 years ago
- Coding Introduction / Tutorial, mostly by writing small little fun games☆24Jun 25, 2025Updated 9 months ago
- ☆19Dec 16, 2023Updated 2 years ago
- SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)☆26Nov 3, 2025Updated 5 months ago
- Node to use PyTexturePacker☆23Feb 3, 2025Updated last year
- Personal Knowledge Graph - User Memory and Personality from Digital Footprint☆24Mar 12, 2026Updated last month
- A binary executable of the cling C++ REPL; built from source on Windows 10☆17Sep 5, 2021Updated 4 years ago
- AIPO (AI Product Owner) - GOALだけ伝えればAIが勝手に仕事を進める汎用問題解決システム☆66Jan 29, 2026Updated 2 months ago
- ☆125Dec 23, 2025Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- DP-HyperparamTuning offers an array of tools for fast and easy hypertuning of various hyperparameters for the DP-SGD algorithm.☆23Sep 27, 2021Updated 4 years ago
- Pyxis is an FULL client-side IDE that runs in the browser(static site) with Node.js emulator, Git, and a VSCode-like editor. Fully local…☆44Mar 28, 2026Updated 2 weeks ago
- Compression for unit-norm embedding vectors using spherical coordinates☆81Jan 23, 2026Updated 2 months ago
- Investigating attacks using Splunk Enterprise logs and creating SPL intrusion detection searches based on known attacker TTPs and anomaly…☆29Nov 19, 2023Updated 2 years ago
- AdaptKeyBERT: keyword/keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation☆26Sep 22, 2024Updated last year
- A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.☆20Nov 21, 2024Updated last year
- OBSOLETE : DO NOT USE IT (migrated to gitlab.com)☆27Oct 29, 2017Updated 8 years ago
- ☆11Jun 21, 2023Updated 2 years ago
- A backup of SmokelessRuntimeEFIPatcher☆27Jun 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Jun 22, 2020Updated 5 years ago
- ☆16Mar 2, 2022Updated 4 years ago
- AI-zhipu is an Obsidian plugin that helps you utilize the Zhipu API. 智谱AI obsidian 插件☆26Jun 13, 2024Updated last year
- Unofficial mirror of git://git.code.sf.net/p/openfoam-extend/foam-extend-3.1☆22Feb 15, 2022Updated 4 years ago
- GeneticPromptLab uses genetic algorithms for automated prompt engineering (for LLMs), enhancing quality and diversity through iterative s…☆33Jun 21, 2024Updated last year
- a self improving operating system for agents☆60Updated this week
- Structured, temporal memory for AI agents.☆66Updated this week