raketenkater/llm-server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/raketenkater/llm-server)

raketenkater / llm-server

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

☆226

Alternatives and similar repositories for llm-server

Users that are interested in llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

marinabox / marinabox-sandbox
View on GitHub
This repo contains all the code necessary to build the docker images for the browser and desktop sandbox
☆18Dec 2, 2025Updated 6 months ago
ikantkode / qwen3-2b-ocr-app
View on GitHub
A simple streamlit app to play with qwen3-2b-VL to perform OCR. Dockerized set up, tested with 3060 12 GB.
☆32Nov 23, 2025Updated 6 months ago
blankarrayy / ocrbro
View on GitHub
ocrbro is a dedicated light-weight n8n node which does OCR for simple Images & PDF's
☆19Apr 3, 2026Updated 2 months ago
sendbird-graveyard / mymessenger_tutorial
View on GitHub
☆10Dec 31, 2015Updated 10 years ago
loscrossos / core_zonos
View on GitHub
Windows, Linux and Mac. Fully accelerated on CUDA and MPS
☆17Jun 27, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bipark / go-opencv-caffe-facedetect-blur
View on GitHub
Go & OpenCV & Caffe 기반 얼굴인식 & 얼굴 블러 처리하기
☆13Sep 29, 2019Updated 6 years ago
lxe / yapyap
View on GitHub
fast and simple push to talk dictation
☆47Sep 22, 2025Updated 8 months ago
5Hyeons / StyleTTS2-Vocos
View on GitHub
StyleTTS2 + Vocos as a Decoder
☆13Mar 24, 2025Updated last year
matteoserva / GraphLLM
View on GitHub
☆212Jan 5, 2026Updated 5 months ago
shihwesley / SamPlaysBaseball
View on GitHub
Pitcher mechanics analyzer: single-camera video → 3D biomechanical analysis using SAM 3D Body. Built for MLB player development.
☆50Apr 9, 2026Updated 2 months ago
cloudflareresearch / unweight-kernels
View on GitHub
Lossless compression of BF16 MLP weights for LLM inference on NVIDIA Hopper GPUs
☆54Apr 17, 2026Updated 2 months ago
TigreGotico / chatterbox-onnx
View on GitHub
chatterbox TTS + Voice Clone using onnx
☆28Updated this week
inworld-ai / inworld-api-examples
View on GitHub
Inworld API Examples
☆59Updated this week
Baronco / GenFilesMCP
View on GitHub
GenFilesMCP: Minimal MCP Server for Open Web UI. Generates PPTX, XLSX, DOCX or MD files using user requests and full chat context. *Pul…
☆80May 19, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Zert / gproc-tutorial
View on GitHub
Erlang GProc Tutorial
☆39Aug 24, 2012Updated 13 years ago
Sompote / tiger_cowork
View on GitHub
A self-hosted AI workspace unifying chat, code execution, parallel multi-agent orchestration, and project management. Each agent runs on …
☆60May 7, 2026Updated last month
Apauto-to-all / GPT-soVITS-Inference-batchTool
View on GitHub
这是一个批量推理工具，对同一段文字进行多次推理，并且支持随机参数，直到筛选出最满意的结果。
☆11Aug 19, 2024Updated last year
cloudshipai / cartograph
View on GitHub
Visual codebase mapping plugin for OpenCode - auto-generates architecture diagrams as you code
☆37Jan 5, 2026Updated 5 months ago
TengHu / Interactive-RAG
View on GitHub
☆15Sep 10, 2023Updated 2 years ago
jaywyawhare / GigaVector
View on GitHub
A fast vector database written in C.
☆41Updated this week
ComposioHQ / open-poke
View on GitHub
☆46May 8, 2026Updated last month
darrenhinde / Opencode-skills-example
View on GitHub
Opencode-skil
☆43Jan 12, 2026Updated 5 months ago
montraydavis / AIDocumentRAG
View on GitHub
A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…
☆16Aug 10, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
calmstate / VisualTagger
View on GitHub
Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…
☆11Oct 28, 2024Updated last year
Conv-AI / Reallusion-web
View on GitHub
☆18Oct 21, 2024Updated last year
ganeshnikhil / Kgraph
View on GitHub
generate informative knowledge graph from text using open source models , ollama
☆23Sep 1, 2025Updated 9 months ago
JustLABv1 / justflow
View on GitHub
Workflow Automation Platform
☆12May 29, 2026Updated 2 weeks ago
houtianze / audiobook-generator
View on GitHub
☆15Mar 18, 2026Updated 3 months ago
NafisRayan / AI-Voice-Assistant-ST
View on GitHub
AI voice assistant made with Streamlit python and powered by Gemini, Mistral and PHI-3. This is a virtual assistant application built in …
☆13Aug 26, 2024Updated last year
jorge-menjivar / reactive-agents
View on GitHub
Create and improve AI agents that get better over time with automatic optimization and continuous learning
☆43Apr 2, 2026Updated 2 months ago
houtini-ai / houtini-lm
View on GitHub
MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek…
☆91Jun 9, 2026Updated last week
kdrkdrkdr / lilac
View on GitHub
✨Realtime Voice Changer with 3~ seconds for custom voice in CPU
☆20Apr 21, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
open-uem / openuem-docker
View on GitHub
Repository containing the docker compose file to run OpenUEM in a container environment
☆28Mar 11, 2026Updated 3 months ago
PublicDotCom / claw-skill-public-dot-com
View on GitHub
☆43May 22, 2026Updated 3 weeks ago
Houseofmvps / ultraship
View on GitHub
"ULTRASHIP" Claude Code plugin — 39 skills, 33 tools, 11 agents for ship-ready workflows: planning, review, pentesting, safety guardrails…
☆106Updated this week
Pipelex / pipelex-cookbook
View on GitHub
Cookbook for Pipelex, the declarative language for composable Al workflows. Devtool for agents and mere humans.
☆38Jun 9, 2026Updated last week
conor-is-my-name / Headful-Chrome-Remote-Puppeteer
View on GitHub
☆83May 7, 2025Updated last year
ttulttul / ComfyUI-FlowMatching-Upscaler
View on GitHub
An upscaler node for flow-matching models like Qwen, applying the DemoFusion approach
☆60Jan 29, 2026Updated 4 months ago
SocAIty / Retrieval-based-Voice-Conversion-FastAPI
View on GitHub
Adds a web API to RVC to infer via json requests
☆32Jul 9, 2024Updated last year