erans / selfhostllmLinks
A web-based calculator for estimating GPU memory requirements and maximum concurrent requests for self-hosted LLM inference.
☆24Updated last week
Alternatives and similar repositories for selfhostllm
Users that are interested in selfhostllm are comparing it to the libraries listed below
Sorting:
- Offline-first, desktop AI assistant tailored for educators, enabling them to generate questions directly from source materials.☆23Updated 4 months ago
- AgentX python SDK. Build multi-agent AI workforce.☆43Updated 5 months ago
- A framework for hosting and scaling AI agents.☆38Updated last year
- 🤖 Write and Run AI Agents with Markdown. Run automated AI agents with ease.☆124Updated last month
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- a flexible and customizable React chat component for integrating Parlant's chatbot seamlessly into your website.☆117Updated 2 weeks ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Updated last month
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 11 months ago
- AI-augmented, conversational information retrieval and data exploration☆37Updated last year
- ☆24Updated 10 months ago
- Boost Your Productivity with Nyro☆110Updated last year
- Chat strategies for LLMs☆125Updated this week
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆48Updated last month
- Personal project, Generative AI, Streamlit, Python☆54Updated 7 months ago
- The Ultimate Open-Source RAG-as-a-Service Platform ☕☆51Updated 6 months ago
- Open-source document chat platform with semantic search, RAG (Retrieval Augmented Generation), and multi-provider AI support (OpenRouter,…☆35Updated last week
- ☆57Updated 3 months ago
- Low-code/No-code development platform☆12Updated last year
- an auto coder which automatically fixes errors and improves the code from simple user prompt☆36Updated 11 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆128Updated 10 months ago
- We handle what engineers and IDEs won't: generating and maintaining technical documentation for your codebase, while also providing searc…☆186Updated 2 months ago
- Ready-to-use agent that can interact directly with any tool or native endpoint, in less than 5 lines of code☆42Updated 2 months ago
- Production-grade AI evaluation, prompt management & observability SDK. Automated evaluations with sub-100ms guardrails. No human-in-the-l…☆36Updated last month
- A background agent system for automating end-to-end software engineering tasks, direct from your github repos.☆189Updated 5 months ago
- Make your LLM agent and chat with it simple and fast!☆67Updated 3 weeks ago
- Low code framework to build and launch a crew of AI agents with shared state. Built with https://axllm.dev.☆39Updated last month
- 👷♂️Minion is Agent's Brain. Minion is designed to execute any type of queries, offering a variety of features that demonstrate its flex…☆49Updated last week
- Deep research agents using MiniMax-M2 interleaved thinking☆143Updated 3 weeks ago
- A curated list of awesome resources for vibe coding☆25Updated 2 weeks ago
- A tool to OCR PDFs using gen-AI models☆45Updated this week