Thireus / ik_llama.cppLinks
llama.cpp fork with additional SOTA quants and improved performance
☆26Updated this week
Alternatives and similar repositories for ik_llama.cpp
Users that are interested in ik_llama.cpp are comparing it to the libraries listed below
Sorting:
- Croco.Cpp is fork of KoboldCPP infering GGML/GGUF models on CPU/Cuda with KoboldAI's UI. It's powered partly by IK_LLama.cpp, and compati…☆135Updated this week
- win32 native frontend for llama-cli☆12Updated 9 months ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆27Updated 3 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆73Updated 9 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆28Updated 3 weeks ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆22Updated 4 months ago
- Input your VRAM and RAM and the toolchain will produce a GGUF model tuned to your system within seconds — flexible model sizing and lowes…☆33Updated this week
- ☆82Updated this week
- Stable Diffusion and Flux in pure C/C++☆21Updated 2 weeks ago
- SoTA open-source TTS☆69Updated this week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated last year
- A pipeline parallel training script for LLMs.☆154Updated 4 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆42Updated 3 months ago
- Easily view and modify JSON datasets for large language models☆81Updated 3 months ago
- ☆23Updated 10 months ago
- Comprehensive image resizing capabilities for ComfyUI. Scale by a specific ratio, scale to target megapixels, scale to fixed dimensions. …☆38Updated last month
- ☆86Updated 4 months ago
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…☆66Updated 4 months ago
- ☆95Updated last week
- ☆50Updated 6 months ago
- LLM backed Fantasy Tribe Game☆19Updated 9 months ago
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆52Updated 3 months ago
- ☆121Updated 9 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 9 months ago
- automatically quant GGUF models☆196Updated this week
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆29Updated 3 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆80Updated 10 months ago
- Deploy Apollo HF space locally☆40Updated 8 months ago
- Python package wrapping llama.cpp for on-device LLM inference☆85Updated last month
- deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.☆40Updated 4 months ago