MaggotHATE / Llama_chatLinks
A chat UI for Llama.cpp
☆15Updated last month
Alternatives and similar repositories for Llama_chat
Users that are interested in Llama_chat are comparing it to the libraries listed below
Sorting:
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated 9 months ago
- Image synthesis using machine learning☆22Updated 8 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆21Updated 5 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆41Updated last year
- Stable Diffusion in pure C/C++☆16Updated 3 weeks ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆42Updated 6 months ago
- Llama.cpp-qt is a Python-based GUI wrapper for the LLama.cpp server, providing a user-friendly interface for configuring and running the …☆16Updated 2 years ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- Recording models☆12Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- ☆22Updated last year
- Simple agent framework using Ollama tool calling☆10Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆51Updated 4 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 3 weeks ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆38Updated last year
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated last year
- Controllable Language Model Interactions in TypeScript☆10Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- ☆20Updated last year
- ☆15Updated 9 months ago
- Spotlight-like client for Ollama on Windows.☆28Updated last year
- PowerShell automation to rebuild llama.cpp for a Windows environment.☆35Updated 3 weeks ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated last year
- Experiments with BitNet inference on CPU☆55Updated last year
- Stable Diffusion in pure C/C++☆13Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- High-Performance Text Deduplication Toolkit☆61Updated 5 months ago