🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra) ->
☆33Jun 10, 2025Updated 9 months ago
Alternatives and similar repositories for llm-server
Users that are interested in llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG Chatbot powered by Groq LPU, Ollama and Langchain☆13Mar 5, 2024Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- My Gen AI research☆11Jun 3, 2024Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Dec 22, 2025Updated 3 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆47Jun 9, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Building AI Research Assistant: Multi-Agent RAG System Reading From Multiple Unstructured Sources☆23Jul 15, 2024Updated last year
- LLM code editor for backend services☆16Oct 19, 2024Updated last year
- A reusable leave-behind for enterprise customers showing the differentiator of using the best Vector Store in the world: Astra DB☆21Feb 2, 2024Updated 2 years ago
- Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemma☆42Aug 18, 2025Updated 7 months ago
- Merlin SDK Provides A Unified API To Interact With 20+ LLM Models.☆42Jun 18, 2024Updated last year
- A simple plugin boilerplate to create your own OceanWP extension.☆12Jan 25, 2017Updated 9 years ago
- A macOS version of the oobabooga gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA…☆24Mar 7, 2026Updated 2 weeks ago
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆25May 22, 2024Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Agentis is an application interface for your local AI models with Ollama allowing you to speak with text and voice with your LLM.☆15Jan 23, 2024Updated 2 years ago
- Various bits and pieces - mostly related to articles on my website.☆11May 3, 2020Updated 5 years ago
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- Build an LLM powered Ask the Data App with LangChain (using the Pandas DataFrame Agent) and Streamlit☆28Nov 14, 2023Updated 2 years ago
- Iterative specification refinement tool: feeds your docs through GPT Pro Extended Reasoning via Oracle for multiple revision rounds until…☆53Updated this week
- Master the art of building and enhancing AI agents. Learn to develop flow-based applications, implement agentic search, and incorporate h…☆63Jun 20, 2024Updated last year
- ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in c…☆175Oct 16, 2025Updated 5 months ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- 粤语双拼输入法 Input method for typing Chinese using Cantonese pronunciations with 2-3 keys per character, based on RIME☆11Jul 25, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An Android dictionary application with support for mdx format.☆11Jan 7, 2023Updated 3 years ago
- SEO Suggestions from GPT using Keyword Ranking based on search volume, CPC and paid competition.☆11Jun 28, 2023Updated 2 years ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Jan 14, 2024Updated 2 years ago
- Implementation of Corrective RAG using LangChain and LangGraph.☆28Mar 14, 2025Updated last year
- Node based Notion MCP server☆11May 26, 2025Updated 10 months ago
- MCP server for Liveblocks.☆15Feb 14, 2026Updated last month
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- API GPT4 Free☆14Jan 31, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 早期的计算机使用7位的ASCII编码,为了处理汉字,程序员设计 了用于简体中文的GB2312和用于繁体中文的big5。 GB2312(1980年)一共收录了7445个字符,包括6763个汉字和682个其它符号。汉字区的内码范围高字节从B0-F7,低字节从A1-FE,占用的码…☆10Sep 10, 2017Updated 8 years ago
- Open Data and sources for OSINT in Tajikistan☆13Jan 17, 2025Updated last year
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆20Dec 29, 2024Updated last year
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆13Apr 27, 2024Updated last year
- stream-of-consciousness experience of an AI's thinking process, complete with creative tangents and unexpected connections.☆14Jan 29, 2025Updated last year
- MCP server for searching and surfacing Claude Code conversation history☆65Feb 27, 2026Updated 3 weeks ago
- Example of gatsby site pulling data from Contenta CMS☆14Sep 28, 2017Updated 8 years ago