π€ Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra) ->
β33Jun 10, 2025Updated last year
Alternatives and similar repositories for llm-server
Users that are interested in llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG Chatbot powered by Groq LPU, Ollama and Langchainβ13Mar 5, 2024Updated 2 years ago
- My Gen AI researchβ11Jun 3, 2024Updated 2 years ago
- πA rabbit-fast Rust reimplementation inspired by Claude Code, with native TUI, deeper tooling, and a cleaner path for terminal-first AI β¦β43Apr 9, 2026Updated 2 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.β31Jul 19, 2025Updated 11 months ago
- β19Mar 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β18Oct 13, 2025Updated 8 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!β18Dec 22, 2025Updated 5 months ago
- Python wrapper and CLI for Groq's LPU inference API.β25Apr 23, 2026Updated last month
- Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sourcesβ23Jul 15, 2024Updated last year
- Automatically generate tests for your website by using LLM modelsβ17Aug 7, 2023Updated 2 years ago
- LLM code editor for backend servicesβ16Oct 19, 2024Updated last year
- A reusable leave-behind for enterprise customers showing the differentiator of using the best Vector Store in the world: Astra DBβ21Feb 2, 2024Updated 2 years ago
- This repository contains a toy implementation of a basic RAQA system.β20Jun 3, 2024Updated 2 years ago
- Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemmaβ42Aug 18, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple plugin boilerplate to create your own OceanWP extension.β12Jan 25, 2017Updated 9 years ago
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICAβ¦β25May 27, 2026Updated 3 weeks ago
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3β25May 22, 2024Updated 2 years ago
- My implementation of autogen and memgpt agents that work together to create simple scripts and help plan out larger projects.β30Jan 28, 2024Updated 2 years ago
- Agentis is an application interface for your local AI models with Ollama allowing you to speak with text and voice with your LLM.β17Jan 23, 2024Updated 2 years ago
- LLM Chat is an open-source serverless alternative to ChatGPT.β36Sep 13, 2024Updated last year
- β15Sep 13, 2024Updated last year
- εΏ θοΌθ±θ―ε£θ―8000ε₯β13Jul 21, 2022Updated 3 years ago
- A web interface for SleekDB written in PHPβ11Jan 22, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- telegram bot for quickly downloading from anna's archiveβ12Dec 5, 2022Updated 3 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)β11Jul 19, 2024Updated last year
- A plugin for hapi.js that generates etags for your responsesβ10Jul 22, 2018Updated 7 years ago
- Iterative specification refinement tool: feeds your docs through GPT Pro Extended Reasoning via Oracle for multiple revision rounds untilβ¦β59Mar 22, 2026Updated 2 months ago
- Master the art of building and enhancing AI agents. Learn to develop flow-based applications, implement agentic search, and incorporate hβ¦β74Jun 20, 2024Updated last year
- η²€θ―εζΌθΎε ₯ζ³ Input method for typing Chinese using Cantonese pronunciations with 2-3 keys per character, based on RIMEβ11Jul 25, 2021Updated 4 years ago
- An Android dictionary application with support for mdx format.β11Jan 7, 2023Updated 3 years ago
- An open-source framework for building monolithic or distributed agentic systems, ranging from simple LLM calls to compositional workflowsβ¦β29Jan 14, 2026Updated 5 months ago
- Simple front-end interface for querying a local Ollama API serverβ24Dec 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- π€ AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.β17Jan 14, 2024Updated 2 years ago
- β17Aug 10, 2023Updated 2 years ago
- Work-in-progress on converting Pywal colors to a matching Chrome themeβ11Dec 15, 2019Updated 6 years ago
- Automatically add explanations of unfamiliar words in ebooksβ15Feb 9, 2023Updated 3 years ago
- An AI toolkit for performing complex internet lookups, crawls and summaries. It utilizes and constantly expands it's knowledge stores witβ¦β21Jul 27, 2024Updated last year
- Implementation of Corrective RAG using LangChain and LangGraph.β27May 13, 2026Updated last month
- Node based Notion MCP serverβ11May 26, 2025Updated last year