Smart launcher for llama.cpp / ik_llama.cpp — auto-detects GPUs, optimizes MoE placement, crash recovery
☆203May 5, 2026Updated this week
Alternatives and similar repositories for llm-server
Users that are interested in llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A bytebot variant that uses Holo 1.5 7b to control the desktop☆25Nov 4, 2025Updated 6 months ago
- Pipeline to convert real-life chess boards into a 2D digital format(FEN) from images and live camera feeds. The system has 2 versions: o…☆57Jan 10, 2026Updated 3 months ago
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated 3 months ago
- OpenClaw Operator gives coding agents like Codex and Claude Code the context and playbooks needed to set up, validate, and troubleshoot a…☆19Mar 7, 2026Updated last month
- This repo contains all the code necessary to build the docker images for the browser and desktop sandbox☆18Dec 2, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple streamlit app to play with qwen3-2b-VL to perform OCR. Dockerized set up, tested with 3060 12 GB.☆33Nov 23, 2025Updated 5 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆94Apr 12, 2026Updated 3 weeks ago
- ocrbro is a dedicated light-weight n8n node which does OCR for simple Images & PDF's☆18Apr 3, 2026Updated last month
- ☆36Apr 15, 2026Updated 3 weeks ago
- A new congestion control algorithm for LEO satellite networks.☆37Jan 22, 2026Updated 3 months ago
- fast and simple push to talk dictation☆47Sep 22, 2025Updated 7 months ago
- Self-hosted Excalidraw with persistence and multiple boards☆101Apr 7, 2026Updated 3 weeks ago
- Local-first personal memory & knowledge store for LLM agents. ローカルLLM向けの個人用メモリ/知識ストア(short / chronicle / memopedia / archive の4層構成)。☆22Mar 10, 2026Updated last month
- ☆39Sep 22, 2025Updated 7 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Cursor Talk To Figma MCP☆32Jun 9, 2025Updated 10 months ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated last year
- Visual codebase mapping plugin for OpenCode - auto-generates architecture diagrams as you code☆35Jan 5, 2026Updated 4 months ago
- Lossless compression of BF16 MLP weights for LLM inference on NVIDIA Hopper GPUs☆48Apr 17, 2026Updated 2 weeks ago
- chatterbox TTS + Voice Clone using onnx☆28Dec 31, 2025Updated 4 months ago
- GenFilesMCP: Minimal MCP Server for Open Web UI. Generates PPTX, XLSX, DOCX or MD files using user requests and full chat context. *Pul…☆77Apr 3, 2026Updated last month
- path of exile stash tab parser☆10May 26, 2021Updated 4 years ago
- A self-hosted AI workspace unifying chat, code execution, parallel multi-agent orchestration, and project management. Each agent runs on …☆55Apr 27, 2026Updated last week
- Personal Finance Expense Tracker☆19Nov 14, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built with Next.js and Tailwind CSS.…☆16Feb 9, 2024Updated 2 years ago
- Cookbook for Pipelex, the declarative language for composable Al workflows. Devtool for agents and mere humans.☆35Updated this week
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- AI voice assistant made with Streamlit python and powered by Gemini, Mistral and PHI-3. This is a virtual assistant application built in …☆13Aug 26, 2024Updated last year
- Workflow Automation Platform☆12Mar 29, 2026Updated last month
- Anthropic-compatible HTTP facade over claude-agent-acp☆90Updated this week
- ☆15Mar 18, 2026Updated last month
- MCP server that saves Claude Code tokens by delegating bounded tasks to local or cloud LLMs. Works with LM Studio, Ollama, vLLM, DeepSeek…☆80Apr 21, 2026Updated 2 weeks ago
- "ULTRASHIP" Claude Code plugin — 39 skills, 33 tools, 11 agents for ship-ready workflows: planning, review, pentesting, safety guardrails…☆61Apr 18, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository containing the docker compose file to run OpenUEM in a container environment☆23Mar 11, 2026Updated last month
- JANG — GGUF for MLX. YOU MUST USE JANG_Q RUNTIME. Adaptive Mixed-Precision Quantization + Runtime for Apple Silicon☆143Updated this week
- ☆43Feb 11, 2026Updated 2 months ago
- ☆83May 7, 2025Updated last year
- An upscaler node for flow-matching models like Qwen, applying the DemoFusion approach☆60Jan 29, 2026Updated 3 months ago
- Adds a web API to RVC to infer via json requests☆31Jul 9, 2024Updated last year
- Curated list of free and low cost AI tools, LLM APIs, IDEs, agents, and infrastructure for building real AI apps☆199Apr 21, 2026Updated 2 weeks ago