SamuelTallet / alpine-llama-cpp-serverLinks
A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.
☆27Updated last month
Alternatives and similar repositories for alpine-llama-cpp-server
Users that are interested in alpine-llama-cpp-server are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated this week
- A real-time shared memory layer for multi-agent LLM systems.☆26Updated this week
- Light WebUI for lm.rs☆23Updated 8 months ago
- *NIX SHELL with Local AI/LLM integration☆23Updated 4 months ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Updated 2 months ago
- Generate a wiki for your research topic, sourcing from the web and your docs.☆46Updated 3 months ago
- ☆28Updated 2 weeks ago
- Run and manage MCP servers as Docker containers with a unified HTTP endpoint. Inspired by Docker compose.☆35Updated this week
- The DPAB-α Benchmark☆25Updated 5 months ago
- A RAG system designed to process documents with multimodal content. It can generate factual, context-aware answers to user queries, based…☆21Updated 6 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆76Updated 9 months ago
- ☆24Updated 5 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆86Updated last week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆32Updated 2 weeks ago
- ☆17Updated last month
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆27Updated last month
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- powerful and fast tool calling agents☆49Updated 3 months ago
- Personal voice assistant, with voice interruption and Twilio support☆17Updated 4 months ago
- George is an API leveraging AI to make it easy to control a computer with natural language.☆48Updated 5 months ago
- ☆104Updated last month
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆21Updated 2 weeks ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆96Updated 2 months ago
- ☆130Updated 2 months ago
- Eternal is an experimental platform for machine learning models and workflows.☆68Updated 3 months ago
- Convert URLs into LLM-friendly markdown chunks☆64Updated 9 months ago
- A bot that checks your grammar and phrasing using LLM of choice☆30Updated 4 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆25Updated last month
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆247Updated this week
- A Multi-Agentic AI Assistant/Builder☆14Updated this week