π€ Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra) ->
β33Jun 10, 2025Updated 10 months ago
Alternatives and similar repositories for llm-server
Users that are interested in llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG Chatbot powered by Groq LPU, Ollama and Langchainβ13Mar 5, 2024Updated 2 years ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.β31Jul 19, 2025Updated 8 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β18Oct 13, 2025Updated 6 months ago
- A lightweight Python API wrapper and CLI for Groqβs offering of language models using their ultra fast LPU Inference Engine.β25Sep 12, 2024Updated last year
- A CLI tool you can pipe code and then ask for changes, add documentation, etc, using the OpenAI API.β13Jan 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilotβ47Jun 9, 2025Updated 10 months ago
- Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sourcesβ23Jul 15, 2024Updated last year
- Automatically generate tests for your website by using LLM modelsβ17Aug 7, 2023Updated 2 years ago
- DSPY Experimentsβ15May 2, 2024Updated last year
- A reusable leave-behind for enterprise customers showing the differentiator of using the best Vector Store in the world: Astra DBβ21Feb 2, 2024Updated 2 years ago
- Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemmaβ41Aug 18, 2025Updated 7 months ago
- Merlin SDK Provides A Unified API To Interact With 20+ LLM Models.β42Jun 18, 2024Updated last year
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICAβ¦β24Mar 7, 2026Updated last month
- Backend of LeafGPT, a clone of the original ChatGPT websiteβ17Nov 30, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- My implementation of autogen and memgpt agents that work together to create simple scripts and help plan out larger projects.β30Jan 28, 2024Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β70Nov 17, 2025Updated 4 months ago
- Agentis is an application interface for your local AI models with Ollama allowing you to speak with text and voice with your LLM.β15Jan 23, 2024Updated 2 years ago
- LLM Chat is an open-source serverless alternative to ChatGPT.β36Sep 13, 2024Updated last year
- A web interface for SleekDB written in PHPβ11Jan 22, 2022Updated 4 years ago
- Build an LLM powered Ask the Data App with LangChain (using the Pandas DataFrame Agent) and Streamlitβ28Nov 14, 2023Updated 2 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)β11Jul 19, 2024Updated last year
- Iterative specification refinement tool: feeds your docs through GPT Pro Extended Reasoning via Oracle for multiple revision rounds untilβ¦β57Mar 22, 2026Updated 3 weeks ago
- An IDE for AI codingβ27Updated this week
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in cβ¦β176Oct 16, 2025Updated 5 months ago
- A simple website to manage your Hyper-V VMs and IIS sitesβ12Jan 19, 2023Updated 3 years ago
- β14Sep 18, 2024Updated last year
- β41Feb 5, 2026Updated 2 months ago
- SEO Suggestions from GPT using Keyword Ranking based on search volume, CPC and paid competition.β11Jun 28, 2023Updated 2 years ago
- Simple front-end interface for querying a local Ollama API serverβ24Dec 1, 2023Updated 2 years ago
- Work-in-progress on converting Pywal colors to a matching Chrome themeβ11Dec 15, 2019Updated 6 years ago
- An AI toolkit for performing complex internet lookups, crawls and summaries. It utilizes and constantly expands it's knowledge stores witβ¦β20Jul 27, 2024Updated last year
- Implementation of Corrective RAG using LangChain and LangGraph.β27Mar 14, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Node based Notion MCP serverβ11May 26, 2025Updated 10 months ago
- A vllm proxy server to add security and multi model management for vllm serversβ12May 30, 2024Updated last year
- API GPT4 Freeβ14Jan 31, 2024Updated 2 years ago
- Custom scripts for i3blocks written in bash.β17Apr 22, 2020Updated 5 years ago
- Collection of Rowy's templates for cloud functions cod snippets - including for derivative, action columns and extensions.β14Sep 8, 2023Updated 2 years ago
- stream-of-consciousness experience of an AI's thinking process, complete with creative tangents and unexpected connections.β14Jan 29, 2025Updated last year
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speechβ34May 1, 2024Updated last year