Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆18Jan 10, 2025Updated last year
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Firebase REST Client for Node.js☆11Jun 2, 2016Updated 9 years ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆53Mar 5, 2026Updated 3 weeks ago
- Multi-turn dataset management tool for LLM trainers☆12Mar 31, 2025Updated 11 months ago
- ☆17Dec 16, 2024Updated last year
- Automates the creation of full-text (sound and text) ebooks in epub/epub3/daisy format, the webserver/client creates smil files to sync a…☆10Nov 12, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A comprehensive WebUI Toolkit for Resemble-AI's Chatterbox☆23Jun 7, 2025Updated 9 months ago
- Simple node proxy for llama-server that enables MCP use☆19May 10, 2025Updated 10 months ago
- AI Search engine☆13Sep 24, 2025Updated 6 months ago
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated last month
- Enterprise-ready vector database toolkit for building searchable knowledge bases from multiple data sources. Supports multi-project manag…☆34Updated this week
- CLI for quickly generating citations for websites and books☆19Nov 14, 2018Updated 7 years ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆38Jul 2, 2025Updated 8 months ago
- Proof-of-Concept Research Artifact: Un-concealing Llama 3.2 capabilities☆53Dec 15, 2025Updated 3 months ago
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- 基于官方提供的CosyVoice改造,整体交互适配CosyVoice2模型,开箱即用☆22Jun 15, 2025Updated 9 months ago
- API server for VibeVoice☆27Sep 28, 2025Updated 5 months ago
- ☆50Jul 23, 2025Updated 8 months ago
- 🤖 302 AI Patent Search! 🚀✨☆16Aug 26, 2025Updated 7 months ago
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆99Feb 15, 2026Updated last month
- ☆16Aug 19, 2023Updated 2 years ago
- Google meet bot deployed on Digital ocean join meetings from Google calendar and record audio+transcription.☆10Aug 4, 2021Updated 4 years ago
- AI-optimized knowledge base for building applications with go-zero☆59Feb 28, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- OpenAI compatible API for Dia-1.6B☆36Apr 27, 2025Updated 11 months ago
- 💻🤖 302 AI Prompt Expert! 🚀✨☆25Aug 26, 2025Updated 7 months ago
- Enhanced CosyVoice with one-click Windows installer, voice management WebUI, and a vLLM-accelerated OpenAI TTS API.☆23Aug 3, 2025Updated 7 months ago
- Medusa combo files, Hashcat rules and dictionaries, JRT rules☆14Oct 20, 2022Updated 3 years ago
- mov2mov extension for AUTOMATIC1111/stable-diffusion-webui☆27Sep 13, 2023Updated 2 years ago
- The Complete Kokoro TTS API is a production-grade text-to-speech server with robust text processing, zero-default audio effects, and loca…☆19Sep 26, 2025Updated 6 months ago
- llama-swap + a minimal ollama compatible api☆54Mar 14, 2026Updated 2 weeks ago
- 基于janus-gateway实现的流媒体服务器。☆11May 15, 2024Updated last year
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Fetches and parses ARC69 metadata for Algorand NFTs☆21Jan 5, 2022Updated 4 years ago
- The official signum-network miner☆19Feb 9, 2026Updated last month
- Personalized Virtual Webcam for WebRTC☆18Updated this week
- ONLYOFFICE Document Server is an online office suite comprising viewers and editors for texts, spreadsheets and presentations, fully comp…☆23May 5, 2020Updated 5 years ago
- Curated list of awesome plugins for ChatGPT☆15Mar 23, 2023Updated 3 years ago
- A Terminal Chat Application that makes you 'shut up' heuheuheu :)☆29Apr 12, 2021Updated 4 years ago
- Available Terraform Provider network mirroring service.☆48Nov 20, 2025Updated 4 months ago