LostRuins / koboldcpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
☆5,290Updated this week
Related projects ⓘ
Alternatives and complementary repositories for koboldcpp
- LLM Frontend for Power Users.☆8,346Updated this week
- A Gradio web UI for Large Language Models.☆40,696Updated this week
- Python bindings for llama.cpp☆8,141Updated this week
- For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcpp☆3,522Updated 2 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,680Updated this week
- Atmospheric adventure chat for AI language models (KoboldAI, NovelAI, Pygmalion, OpenAI chatgpt, gpt-4)☆2,208Updated last week
- Lord of Large Language Models Web User Interface☆4,347Updated this week
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,760Updated last year
- Stable Diffusion and Flux in pure C/C++☆3,508Updated 3 weeks ago
- Locally run an Instruction-Tuned Chat-Style LLM☆10,252Updated last year
- SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models☆5,730Updated this week
- Tensor library for machine learning☆11,233Updated this week
- LLM inference in C/C++☆68,097Updated this week
- Extensions API for SillyTavern.☆565Updated 2 months ago
- Multi-Platform Package Manager for Stable Diffusion☆4,783Updated this week
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,120Updated this week
- Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.☆6,919Updated this week
- An OAI compatible exllamav2 API that's both lightweight and fast☆605Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,497Updated last month
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,269Updated 3 months ago
- AI Inferencing at the Edge. A simple one-file way to run various GGML models with KoboldAI's UI with AMD ROCm offloading☆450Updated this week
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, …☆4,588Updated 3 months ago
- The simplest way to run LLaMA on your local machine☆13,099Updated 5 months ago
- ☆563Updated last month
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,814Updated 9 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,919Updated 6 months ago
- Large-scale LLM inference engine☆1,134Updated this week
- ☆8,526Updated this week
- Official inference library for Mistral models☆9,738Updated last week
- Simplified installers for oobabooga/text-generation-webui.☆553Updated last year