Chen-zexi / vllm-cliLinks
A command-line interface tool for serving LLM using vLLM.
☆433Updated last week
Alternatives and similar repositories for vllm-cli
Users that are interested in vllm-cli are comparing it to the libraries listed below
Sorting:
- ☆246Updated 3 weeks ago
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆330Updated 3 months ago
- Model Activity Visualiser☆519Updated 6 months ago
- Salesforce Enterprise Deep Research☆147Updated last week
- MiniMax-M2, a Mini model built for Max coding & agentic workflows.☆656Updated this week
- ☆300Updated 2 months ago
- An open-source application for building, observing, and collaborating with teams of AI agents.☆394Updated 2 months ago
- ☆573Updated 4 months ago
- ☆152Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆445Updated 2 months ago
- Turn topics into essays in seconds!☆189Updated 3 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆192Updated 2 months ago
- 🧍♂️LLM as a manager for approval processes.☆209Updated 6 months ago
- The Open Deep Research app – generate reports with OSS LLMs☆302Updated 3 months ago
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆163Updated last month
- LettuceDetect is a hallucination detection framework for RAG applications.☆507Updated last month
- ☆136Updated 2 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆314Updated 10 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆276Updated 2 months ago
- ☆77Updated 2 months ago
- Library for model distillation☆153Updated last month
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆641Updated last week
- Local debugging agent that runs in your terminal☆392Updated 4 months ago
- ☆248Updated 3 months ago
- ☆1,986Updated last week
- ☆232Updated 3 months ago
- ☆445Updated this week
- Agentic testing for agentic codebases☆621Updated last week
- An Automatic Prompt Optimization Framework for Large Language Models☆130Updated 2 months ago
- All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.☆713Updated this week