k-koehler / gguf-tensor-overriderLinks
β48Updated 2 weeks ago
Alternatives and similar repositories for gguf-tensor-overrider
Users that are interested in gguf-tensor-overrider are comparing it to the libraries listed below
Sorting:
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backendsβ48Updated 2 months ago
- π FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GPβ¦β39Updated this week
- β28Updated 4 months ago
- β84Updated 3 weeks ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.β226Updated this week
- β104Updated 2 months ago
- β168Updated 2 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differeβ¦β81Updated last week
- Run Orpheus 3B Locally With LM Studioβ31Updated 7 months ago
- llama-swap + a minimal ollama compatible apiβ30Updated last week
- Python language chat with Ollama models locally, anthropic and openaiβ24Updated 6 months ago
- β83Updated 8 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.β80Updated 3 weeks ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.β206Updated 5 months ago
- β51Updated 8 months ago
- Sparse Inferencing for transformer based LLMsβ201Updated 2 months ago
- β56Updated 8 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.β165Updated last year
- A web application that converts speech to speech 100% privateβ77Updated 4 months ago
- β206Updated last month
- Automated speech dataset creatorβ204Updated 4 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!β28Updated 2 months ago
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complianβ¦β66Updated 6 months ago
- Eternal is an experimental platform for machine learning models and workflows.β67Updated 7 months ago
- β180Updated last month
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech Gβ¦β25Updated 7 months ago
- Personal voice assistant, with voice interruption and Twilio supportβ18Updated 8 months ago
- Code for Papeg.aiβ225Updated 9 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.β262Updated 7 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorerβ255Updated last week