Chen-zexi / vllm-cliLinks
A command-line interface tool for serving LLM using vLLM.
☆403Updated 3 weeks ago
Alternatives and similar repositories for vllm-cli
Users that are interested in vllm-cli are comparing it to the libraries listed below
Sorting:
- Model Activity Visualiser☆520Updated 5 months ago
- ☆295Updated last month
- The Open Deep Research app – generate reports with OSS LLMs☆298Updated 2 months ago
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆153Updated last week
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆305Updated 2 weeks ago
- 🧍♂️LLM as a manager for approval processes.☆209Updated 5 months ago
- Turn topics into essays in seconds!☆187Updated 2 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆370Updated 3 weeks ago
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆308Updated 2 months ago
- Train Large Language Models on MLX.☆159Updated last month
- An open-source application for building, observing, and collaborating with teams of AI agents.☆387Updated last month
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆307Updated 8 months ago
- ☆231Updated 2 months ago
- ☆131Updated last month
- Turn local files into a prompt for an LLM☆176Updated 7 months ago
- ☆142Updated last month
- LettuceDetect is a hallucination detection framework for RAG applications.☆487Updated last week
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆272Updated 3 weeks ago
- ☆564Updated 2 months ago
- Make your meetings accessible to AI Agents☆348Updated last week
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆172Updated 3 weeks ago
- ☆182Updated 7 months ago
- Library for model distillation☆150Updated last week
- ☆133Updated 2 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆542Updated this week
- Agentic testing for agentic codebases☆590Updated last week
- II-Researcher: a new open-source framework designed to aid building search / research agents☆470Updated last month
- ☆209Updated 7 months ago
- RunAgent simplifies serverless deployment of your AI agents. With a powerful CLI, multi-language SDK support, built-in agent invocation &…☆322Updated last week
- 🧠 Advanced Claude streaming interface with interleaved thinking, dynamic tool discovery, and MCP integration. Watch Claude think through…☆181Updated 3 months ago