Generate a llama-quantize command to copy the quantization parameters of any GGUF
☆30Jan 23, 2026Updated last month
Alternatives and similar repositories for quant_clone
Users that are interested in quant_clone are comparing it to the libraries listed below
Sorting:
- ☆21Dec 22, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 9 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆23Sep 1, 2025Updated 5 months ago
- Offline-first, desktop AI assistant tailored for educators, enabling them to generate questions directly from source materials.☆23Aug 2, 2025Updated 6 months ago
- A one-file Ollama CLI client written in bash☆30Sep 7, 2025Updated 5 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆40Apr 5, 2025Updated 10 months ago
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆51Jan 9, 2026Updated last month
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆51Aug 21, 2025Updated 6 months ago
- Sesame CSM 1B Voice Cloning☆332Mar 15, 2025Updated 11 months ago
- ☆17Feb 4, 2026Updated 3 weeks ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- MQTT interface for Bluetti power stations☆16Jun 21, 2025Updated 8 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆170Dec 15, 2025Updated 2 months ago
- For individual users, watsonx Code Assistant can access a local IBM Granite model☆37Jun 25, 2025Updated 8 months ago
- The accompany backend for PAI app☆12Mar 24, 2025Updated 11 months ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated last year
- UnityKit - Unity3D in Swift - Pattern replicate using SceneKit☆11Oct 19, 2025Updated 4 months ago
- MCP server for GNU Radio☆31Jan 5, 2026Updated last month
- ☆13Jul 11, 2022Updated 3 years ago
- Orchestration middleware for Home Assistant + Ollama: enables 8-20B models to handle complex multi-intent commands through intelligent ta…☆23Feb 6, 2026Updated 3 weeks ago
- Kafka Manager Dockerfile☆11Nov 22, 2017Updated 8 years ago
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 5 months ago
- ☆30Jun 6, 2025Updated 8 months ago
- ☆12Apr 13, 2024Updated last year
- Startup equity calculator☆12Dec 11, 2019Updated 6 years ago
- Latex document template of Final Degree Projects done in ETSISI UPM school.☆10Apr 27, 2025Updated 10 months ago
- Firmware for the Zaunkoenig M3K.☆12Jul 25, 2025Updated 7 months ago
- ☆10Jan 23, 2025Updated last year
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Oct 29, 2025Updated 3 months ago
- A web application that converts speech to speech 100% private☆83Jun 3, 2025Updated 8 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50May 19, 2025Updated 9 months ago
- ☆12Apr 2, 2025Updated 10 months ago
- A powerful AI-integrated Terminal Shell powered by the Ollama LLM interface.☆14May 30, 2025Updated 8 months ago
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 8 months ago
- A bot that provides Youtube vid chapters on Twitter (a.k.a. X )☆12Feb 5, 2025Updated last year
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆16Aug 1, 2025Updated 6 months ago
- ⍺-MON anonymizes network traffic in real time. This software process network traffic on input interfaces to remove privacy sensitive info…☆12Sep 27, 2021Updated 4 years ago