micytao / vllm-playgroundLinks
A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and enterprise deployment on OpenShift/Kubernetes.
☆331Updated this week
Alternatives and similar repositories for vllm-playground
Users that are interested in vllm-playground are comparing it to the libraries listed below
Sorting:
- A command-line interface tool for serving LLM using vLLM.☆461Updated last month
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆393Updated last month
- Community maintained hardware plugin for vLLM on Apple Silicon☆260Updated this week
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆96Updated this week
- Common recipes to run vLLM☆335Updated last week
- Self-host LLMs with vLLM and BentoML☆163Updated last month
- The LLM abstraction layer for modern AI agent applications.☆496Updated this week
- ☆236Updated last month
- ☆431Updated last month
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆799Updated this week
- Completed research on semantic retrieval augmented generation through novel semantic similarity graph traversal algorithms.☆265Updated 2 months ago
- Library for model distillation☆160Updated 4 months ago
- REFRAG-style RAG (compress → sense/select → expand) — Single-file reference implementation☆197Updated 3 weeks ago
- Codebase for FinePDFs☆161Updated last week
- Benchmark and optimize LLM inference across frameworks with ease☆155Updated 4 months ago
- 🧍♂️LLM as a manager for approval processes.☆211Updated 9 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 4 months ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆573Updated 3 weeks ago
- Standardized environment infrastructure for Agentic AI development.☆240Updated this week
- ☆916Updated this week
- Salesforce Enterprise Deep Research☆1,035Updated this week
- ☆263Updated 2 months ago
- Bringing the Unsloth experience to Mac users via Apple's MLX framework☆365Updated last week
- Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.☆399Updated 2 months ago
- A list of AI memory projects☆605Updated last year
- Route LLM requests to the best model for the task at hand.☆161Updated last week
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆171Updated 4 months ago
- "AnyTool: Universal Tool-Use Layer for AI Agents"☆466Updated this week
- ☆195Updated 5 months ago
- ☆81Updated 4 months ago