Generate a llama-quantize command to copy the quantization parameters of any GGUF
☆31Jan 23, 2026Updated last month
Alternatives and similar repositories for quant_clone
Users that are interested in quant_clone are comparing it to the libraries listed below
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆24Sep 1, 2025Updated 6 months ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Offline-first, desktop AI assistant tailored for educators, enabling them to generate questions directly from source materials.☆23Aug 2, 2025Updated 7 months ago
- ☆21Dec 22, 2024Updated last year
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 9 months ago
- MCP server for GNU Radio☆33Jan 5, 2026Updated 2 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Oct 29, 2025Updated 4 months ago
- A one-file Ollama CLI client written in bash☆30Sep 7, 2025Updated 6 months ago
- An IRCd in pure Bash.☆19Jan 23, 2026Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- Useful Nautilus Scripts made with☆13Jul 27, 2021Updated 4 years ago
- ☆18Feb 7, 2026Updated last month
- Orchestration middleware for Home Assistant + Ollama: enables 8-20B models to handle complex multi-intent commands through intelligent ta…☆24Feb 6, 2026Updated last month
- ☆12Aug 17, 2024Updated last year
- A bot that provides Youtube vid chapters on Twitter (a.k.a. X )☆12Feb 5, 2025Updated last year
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆52Jan 9, 2026Updated 2 months ago
- Better Encrypted Datastore is a library for securely storing encrypted data inside Datastore. In addition, the library extends Datastore'…☆11Mar 23, 2025Updated 11 months ago
- An agent that can run everywhere - even in your watch!☆30Mar 5, 2026Updated 2 weeks ago
- A browserless HTML testing library for Python.☆20Dec 3, 2025Updated 3 months ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆18May 3, 2024Updated last year
- ☆15Feb 1, 2025Updated last year
- A realtime speech to text diarization system to gather and interleave speech from multiple speaker audio.☆27Jan 29, 2026Updated last month
- ☆19Jun 11, 2025Updated 9 months ago
- ☆54Oct 10, 2025Updated 5 months ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- AI Assistant☆20Feb 21, 2026Updated 3 weeks ago
- Evaluating LLMs by having them play games against each other☆23Sep 9, 2025Updated 6 months ago
- ☆83Feb 28, 2025Updated last year
- UnityKit - Unity3D in Swift - Pattern replicate using SceneKit☆11Oct 19, 2025Updated 5 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆41Apr 5, 2025Updated 11 months ago
- LLM Powered Social Media Simulator☆10Mar 29, 2025Updated 11 months ago
- Persys server. All services you need to run Persys locally.☆18Feb 5, 2025Updated last year
- Create your Mii and discover alone or with friends the Wuhu Island !☆15Jun 5, 2025Updated 9 months ago
- A powerful non-custodial multi-wallet for PirateCash, Cosanta, Bitcoin, Ethereum, Binance Smart Chain, Avalanche, Solana and other blockc…☆29Updated this week
- PrompFlower 1.0 is a command-line tool that generates AI prompts entirely locally and offline using a local AI engine via Ollama.☆12Mar 18, 2025Updated last year
- An MCP-enabled Qwen3 0.6B demo with adjustable thinking budget, all in your browser!☆29Jun 2, 2025Updated 9 months ago
- ☆12Dec 16, 2024Updated last year
- A wireframe shader for Unity. Also works as an screen image effect☆14Dec 27, 2019Updated 6 years ago
- Deploying Open-WebUI with Ollama using Docker Compose. Automatically loads deepseek-r1:8b model.☆16Feb 1, 2025Updated last year