Chen-zexi / vllm-cliLinks
A command-line interface tool for serving LLM using vLLM.
☆464Updated this week
Alternatives and similar repositories for vllm-cli
Users that are interested in vllm-cli are comparing it to the libraries listed below
Sorting:
- A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes…☆340Updated this week
- The LLM abstraction layer for modern AI agent applications.☆496Updated 2 weeks ago
- ☆264Updated 2 months ago
- ☆439Updated last month
- Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere☆1,088Updated this week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆225Updated 3 months ago
- ☆303Updated 5 months ago
- Verify Precision of all Kimi K2 API Vendor☆498Updated last week
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆393Updated last month
- Train Large Language Models on MLX.☆241Updated last week
- A list of AI memory projects☆615Updated last year
- Bringing the Unsloth experience to Mac users via Apple's MLX framework☆404Updated last week
- Evolve your language agent with Agentic Context Engineering (ACE)☆529Updated 2 months ago
- Community maintained hardware plugin for vLLM on Apple Silicon☆313Updated this week
- Library for model distillation☆160Updated 4 months ago
- ☆975Updated 2 weeks ago
- ☆173Updated 5 months ago
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆172Updated 4 months ago
- ☆194Updated 6 months ago
- Model Activity Visualiser☆520Updated 9 months ago
- ☆237Updated 2 months ago
- Completed research on semantic retrieval augmented generation through novel semantic similarity graph traversal algorithms.☆266Updated 2 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆221Updated 5 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆599Updated this week
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆565Updated 2 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆458Updated 5 months ago
- "AnyTool: Universal Tool-Use Layer for AI Agents"☆477Updated 2 weeks ago
- An Automatic Prompt Optimization Framework for Large Language Models☆878Updated 5 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆318Updated last year
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆280Updated 3 weeks ago