ruska-ai / llm-serverView external linksLinks
🤖 Open-source LLM server (OpenAI, Ollama, Groq, Anthropic) with support for HTTP, Streaming, Agents, RAG (Deprecated check out Orchestra) ->
☆32Jun 10, 2025Updated 8 months ago
Alternatives and similar repositories for llm-server
Users that are interested in llm-server are comparing it to the libraries listed below
Sorting:
- RAG Chatbot powered by Groq LPU, Ollama and Langchain☆13Mar 5, 2024Updated last year
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆23Dec 9, 2023Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Agentis is an application interface for your local AI models with Ollama allowing you to speak with text and voice with your LLM.☆15Jan 23, 2024Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 4 months ago
- A lightweight Python API wrapper and CLI for Groq’s offering of language models using their ultra fast LPU Inference Engine.☆23Sep 12, 2024Updated last year
- Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemma☆42Aug 18, 2025Updated 5 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Jun 9, 2025Updated 8 months ago
- Master the art of building and enhancing AI agents. Learn to develop flow-based applications, implement agentic search, and incorporate h…☆62Jun 20, 2024Updated last year
- An open-source framework for building monolithic or distributed agentic systems, ranging from simple LLM calls to compositional workflows…☆25Jan 14, 2026Updated last month
- Chat AI (↓↓Scroll to see more↓↓)☆27Jul 24, 2024Updated last year
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆24Nov 26, 2023Updated 2 years ago
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆25May 22, 2024Updated last year
- Autonomous agent networks for task automation that requires multi-step reasoning☆29Sep 1, 2025Updated 5 months ago
- Build an LLM powered Ask the Data App with LangChain (using the Pandas DataFrame Agent) and Streamlit☆28Nov 14, 2023Updated 2 years ago
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆27Dec 13, 2025Updated 2 months ago
- My implementation of autogen and memgpt agents that work together to create simple scripts and help plan out larger projects.☆30Jan 28, 2024Updated 2 years ago
- ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in c…☆173Oct 16, 2025Updated 3 months ago
- Implementation of Corrective RAG using LangChain and LangGraph.☆28Mar 14, 2025Updated 11 months ago
- Dabarqus is incredibly fast RAG that runs everywhere.☆59Jan 30, 2025Updated last year
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- AI POCS: ML, NLP, LLM, Vision, Classification, clustering, GenAI, Transformers, PyTorch, Keras, All things AI POCS.☆12Updated this week
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Sep 13, 2024Updated last year
- Deep Research through Multi-Agents, using GraphRAG☆85Aug 21, 2025Updated 5 months ago
- A new novel multi-modality (Vision) RAG architecture☆36Oct 1, 2024Updated last year
- Reusable OpenAI secure UI and infrastructure for AI Chat with Azure☆18Aug 4, 2025Updated 6 months ago
- 这是一次学校大作业,希望和大家分享,一起进步。此项目分驱动部分,遥控部分,视觉部分以及Web控制部分。是基于ESP32与Jetson Nano做的一个小项目。其中运用到了蓝牙串口片与片之间的通信,IP私域下的多机通信,以及ESP32中便携的Web功能进行通信。具体各部分内容…☆12Nov 5, 2024Updated last year
- 🕹 Pikachu-volleyball game-based multi-agent RL environment using PettingZoo☆11Sep 29, 2024Updated last year
- This is a list used to collect the available (open-source / closed-source) projects that comply with Google Agent2Agent.☆13Apr 24, 2025Updated 9 months ago
- 稚晖君电子Esp32脱机版☆11Jan 15, 2025Updated last year
- My portfolio website made with React and Sass☆15Sep 5, 2024Updated last year
- A platform designed to facilitate the development of advanced conversational agents using retrieval augmented generation (RAG).☆34Sep 28, 2025Updated 4 months ago
- Run CrewAI agent workflows on local LLM models with Llamafile and Ollama☆39May 24, 2024Updated last year
- ☆14May 27, 2025Updated 8 months ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- yolo目标检测算法☆15Jul 27, 2025Updated 6 months ago
- Emotion based music recommender system☆11Mar 26, 2025Updated 10 months ago
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆19Dec 29, 2024Updated last year
- A minimalistic deployment software focused on simplicity and clarity.☆11Feb 12, 2022Updated 4 years ago