π€ Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra) ->
β33Jun 10, 2025Updated 11 months ago
Alternatives and similar repositories for llm-server
Users that are interested in llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG Chatbot powered by Groq LPU, Ollama and Langchainβ13Mar 5, 2024Updated 2 years ago
- Integrated LLM-based document and data Q&A with knowledge graph visualizationβ24Dec 9, 2023Updated 2 years ago
- πA rabbit-fast Rust reimplementation inspired by Claude Code, with native TUI, deeper tooling, and a cleaner path for terminal-first AI β¦β43Apr 9, 2026Updated last month
- Minimal RAG (Retrieval Augmented Generation) website with Pinecone, FastAPI, NextJS, MongoDBβ11Jun 30, 2024Updated last year
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.β31Jul 19, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β19Mar 17, 2025Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β18Oct 13, 2025Updated 7 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!β18Dec 22, 2025Updated 5 months ago
- Follow Up Boss API examplesβ11Apr 3, 2023Updated 3 years ago
- Python wrapper and CLI for Groq's LPU inference API.β25Apr 23, 2026Updated last month
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilotβ47Jun 9, 2025Updated 11 months ago
- Model Context Protocol (MCP) server for BatchData.io property and address APIs - Real estate data integration for Claude and other AI assβ¦β30Jul 21, 2025Updated 10 months ago
- Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sourcesβ23Jul 15, 2024Updated last year
- Automatically generate tests for your website by using LLM modelsβ17Aug 7, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LLM code editor for backend servicesβ16Oct 19, 2024Updated last year
- A reusable leave-behind for enterprise customers showing the differentiator of using the best Vector Store in the world: Astra DBβ21Feb 2, 2024Updated 2 years ago
- This repository contains a toy implementation of a basic RAQA system.β20Jun 3, 2024Updated last year
- Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemmaβ42Aug 18, 2025Updated 9 months ago
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICAβ¦β24Mar 7, 2026Updated 2 months ago
- Backend of LeafGPT, a clone of the original ChatGPT websiteβ17Nov 30, 2023Updated 2 years ago
- A web based whiteboard using Django channels and JSON Web Tokensβ21Mar 31, 2016Updated 10 years ago
- My implementation of autogen and memgpt agents that work together to create simple scripts and help plan out larger projects.β30Jan 28, 2024Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β70Nov 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Agentis is an application interface for your local AI models with Ollama allowing you to speak with text and voice with your LLM.β16Jan 23, 2024Updated 2 years ago
- LLM Chat is an open-source serverless alternative to ChatGPT.β36Sep 13, 2024Updated last year
- Competitive coding submissions at Leetcode goes hereβ16Aug 22, 2021Updated 4 years ago
- β15Sep 13, 2024Updated last year
- A web interface for SleekDB written in PHPβ11Jan 22, 2022Updated 4 years ago
- Official Documentation for DSPy Libraryβ23Updated this week
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)β11Jul 19, 2024Updated last year
- Master the art of building and enhancing AI agents. Learn to develop flow-based applications, implement agentic search, and incorporate hβ¦β72Jun 20, 2024Updated last year
- ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in cβ¦β178Oct 16, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple website to manage your Hyper-V VMs and IIS sitesβ12Jan 19, 2023Updated 3 years ago
- β40Feb 5, 2026Updated 3 months ago
- An open-source framework for building monolithic or distributed agentic systems, ranging from simple LLM calls to compositional workflowsβ¦β29Jan 14, 2026Updated 4 months ago
- The fabric-mcp-server is an MCP server that integrates Fabric patterns with AI coding agents and assistants, exposing them as tools for Aβ¦β18Jul 28, 2025Updated 10 months ago
- Simple front-end interface for querying a local Ollama API serverβ24Dec 1, 2023Updated 2 years ago
- π€ AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.β17Jan 14, 2024Updated 2 years ago
- An AI toolkit for performing complex internet lookups, crawls and summaries. It utilizes and constantly expands it's knowledge stores witβ¦β21Jul 27, 2024Updated last year