Efficient non-uniform quantization with GPTQ for GGUF
☆60Sep 17, 2025Updated 5 months ago
Alternatives and similar repositories for gptq-gguf-toolkit
Users that are interested in gptq-gguf-toolkit are comparing it to the libraries listed below
Sorting:
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- ☆15Apr 9, 2025Updated 10 months ago
- This Streamlit application allows users to upload images and engage in interactive conversations about them using the Ollama Vision Model…☆15Nov 11, 2024Updated last year
- Llama.cpp-qt is a Python-based GUI wrapper for the LLama.cpp server, providing a user-friendly interface for configuring and running the …☆16Oct 4, 2023Updated 2 years ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 4 months ago
- ☆37Jul 4, 2025Updated 8 months ago
- ☆29Feb 24, 2025Updated last year
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆29Feb 4, 2025Updated last year
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Thin wrapper around GGML to make life easier☆42Nov 5, 2025Updated 4 months ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆38Mar 19, 2025Updated 11 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆56Feb 25, 2026Updated last week
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated 2 weeks ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Feb 27, 2025Updated last year
- ☆165Jun 22, 2025Updated 8 months ago
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆98Jan 25, 2026Updated last month
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- Enemies for your LLM☆35Jan 20, 2026Updated last month
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated last month
- ☆64Jun 24, 2025Updated 8 months ago
- ☆10Sep 4, 2025Updated 6 months ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆24Feb 21, 2026Updated 2 weeks ago
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Kotlin library for Cortex.cpp a Local AI API Platform that is used to run and customize LLMs.☆10Apr 2, 2025Updated 11 months ago
- Convert Confluence MIME exports (.doc) to clean Markdown☆34Jan 13, 2026Updated last month
- The Treblle SDK the Django framework☆11Sep 10, 2025Updated 5 months ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Feb 25, 2026Updated last week
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 4 months ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- ☆63Jul 10, 2025Updated 7 months ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆240Jan 26, 2026Updated last month
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- An example AWS SAM app showing how to deploy a fastai app using Lambda Container feature☆13Dec 6, 2020Updated 5 years ago
- ☆21Jul 18, 2025Updated 7 months ago
- This repo has scripts to compare various powerful RL methods☆33Feb 23, 2026Updated last week