Generate a llama-quantize command to copy the quantization parameters of any GGUF
☆35Apr 20, 2026Updated last month
Alternatives and similar repositories for quant_clone
Users that are interested in quant_clone are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 8 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 11 months ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Implements harmful/harmless refusal removal using pure HF Transformers☆21May 8, 2025Updated last year
- Offline-first, desktop AI assistant tailored for educators, enabling them to generate questions directly from source materials.☆24Aug 2, 2025Updated 9 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆21Dec 22, 2024Updated last year
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 11 months ago
- ☆44Jun 27, 2025Updated 10 months ago
- MCP server for GNU Radio☆41Mar 31, 2026Updated last month
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆52Oct 29, 2025Updated 6 months ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Mar 8, 2025Updated last year
- This is a Python package to add tool calling capabilities to newly released LLMs on LangChain's ChatOpenAI, AzureAIChatCompletionsModel a…☆118Jun 4, 2025Updated 11 months ago
- Make AI Free Again - Shard is a GUI for Ollama LLM's. Powered by Next.js.☆63Feb 28, 2025Updated last year
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.☆25May 12, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An IRCd in pure Bash.☆20Jan 23, 2026Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated last year
- Useful Nautilus Scripts made with☆13Jul 27, 2021Updated 4 years ago
- Orchestration middleware for Home Assistant + Ollama: enables 8-20B models to handle complex multi-intent commands through intelligent ta…☆24Apr 3, 2026Updated last month
- ☆12Aug 17, 2024Updated last year
- The accompany backend for PAI app☆12Mar 24, 2025Updated last year
- ☆12May 30, 2025Updated 11 months ago
- Send images, captions and text to Telegram channels and DM's from comfyui☆12Apr 22, 2024Updated 2 years ago
- A bot that provides Youtube vid chapters on Twitter (a.k.a. X )☆12Feb 5, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆53Jan 9, 2026Updated 4 months ago
- Better Encrypted Datastore is a library for securely storing encrypted data inside Datastore. In addition, the library extends Datastore'…☆13Mar 23, 2025Updated last year
- An agent that can run everywhere - even in your watch!☆33Apr 8, 2026Updated last month
- Awesome AI Benchmarks☆32Jan 16, 2026Updated 4 months ago
- A privacy-focused, censorship-resistant multinet Android radio player built with Claude Code. Supports I2P, clearnet and Tor streaming.☆52May 11, 2026Updated last week
- A browserless HTML testing library for Python.☆22Dec 3, 2025Updated 5 months ago
- ☆16Feb 1, 2025Updated last year
- ☆19Jun 11, 2025Updated 11 months ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆48Feb 18, 2026Updated 3 months ago
- ☆57Oct 10, 2025Updated 7 months ago
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year
- AI Assistant☆20Feb 21, 2026Updated 2 months ago
- ☆11Feb 15, 2025Updated last year
- ☆82Feb 28, 2025Updated last year
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆44Apr 5, 2025Updated last year