SamuelTallet / alpine-llama-cpp-serverLinks
A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.
☆29Updated last month
Alternatives and similar repositories for alpine-llama-cpp-server
Users that are interested in alpine-llama-cpp-server are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆82Updated last week
- George is an API leveraging AI to make it easy to control a computer with natural language.☆50Updated 10 months ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆81Updated 3 months ago
- Enhancing LLMs with LoRA☆174Updated 3 weeks ago
- ☆207Updated 2 months ago
- Open source LLM UI, compatible with all local LLM providers.☆176Updated last year
- A simple tool to anonymize LLM prompts.☆65Updated 9 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆277Updated 2 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51Updated 5 months ago
- git-like rag pipeline☆246Updated 3 weeks ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆46Updated 2 months ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆19Updated 6 months ago
- A platform to self-host AI on easy mode☆173Updated this week
- No-messing-around sh client for llama.cpp's server☆30Updated last year
- LocalScore is an open benchmark which helps you understand how well your computer can handle local AI tasks.☆65Updated 2 months ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆33Updated 3 months ago
- ☆133Updated 6 months ago
- ☆77Updated this week
- A web application that converts speech to speech 100% private☆77Updated 5 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆100Updated 2 months ago
- ☆49Updated last month
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆145Updated last month
- The DPAB-α Benchmark☆30Updated 9 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆17Updated 2 months ago
- Light WebUI for lm.rs☆24Updated last year
- ☆35Updated last year
- ☆64Updated 10 months ago
- Something similar to Apple Intelligence?☆61Updated last year
- A frontend for creative writing with LLMs☆135Updated last year
- ☆22Updated 9 months ago