A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and enterprise deployment on OpenShift/Kubernetes.
☆470Apr 7, 2026Updated 2 months ago
Alternatives and similar repositories for vllm-playground
Users that are interested in vllm-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)☆29May 26, 2026Updated 3 weeks ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆1,254Updated this week
- A command-line interface tool for serving LLM using vLLM.☆501Jan 25, 2026Updated 4 months ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆105Apr 7, 2026Updated 2 months ago
- This repo hosts links to blogs, documentation and assets referenced by the Security Guide Blog.☆11May 7, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The agent for kube-advisor.io☆14Jan 28, 2025Updated last year
- 🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the p…☆90Feb 9, 2026Updated 4 months ago
- This repository provides a subscription that will populate all the latest OpenShift images into Advanced Cluster Manager☆14Updated this week
- A production-ready iOS automation MCP server built with FastMCP 2.0, featuring clean modular architecture with complete platform segregat…☆31Jul 26, 2025Updated 10 months ago
- A Web app demonstrating multimodal image search using Visualized-BGE model☆15Dec 1, 2024Updated last year
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year
- A TypeScript Model Context Protocol (MCP) server to allow LLMs to programmatically construct mind maps to explore an idea space, with enf…☆28Mar 23, 2025Updated last year
- Extract structured information from images with the AI SDK☆21Aug 14, 2024Updated last year
- ☆10Apr 15, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A pipecat bot demo implementation of a Spotify assistant for creating playlists☆19Oct 14, 2025Updated 8 months ago
- A high-performance and light-weight router for vLLM large scale deployment☆268May 6, 2026Updated last month
- Links to recourses for the Lean Theorem Prover☆13Dec 3, 2019Updated 6 years ago
- A cross-platform GPU monitor TUI with support for both Apple Silicon and NVIDIA GPUs.☆89Mar 5, 2026Updated 3 months ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40May 20, 2026Updated 3 weeks ago
- LMCache: Supercharge Your LLM with the Fastest KV Cache Layer☆9,158Updated this week
- An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training☆15Aug 9, 2024Updated last year
- Deploy your own self-hosted GenAI cluster on Kubernetes using Ollama and OpenWebUI.☆12Feb 16, 2026Updated 4 months ago
- step by step guide to write custom Kubernetes controller - the hard way☆11Jul 15, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Jun 3, 2026Updated 2 weeks ago
- Agentic Swarm AI Agent with persistent long-term memory, multi-provider LLM support, token management, self-learning, and Telegram bot in…☆22Feb 27, 2026Updated 3 months ago
- A sandbox for showcasing different use cases of LangChain's createAgent☆73Dec 11, 2025Updated 6 months ago
- Remote MCP Server built using Cloudflare Workers.☆30Jun 3, 2025Updated last year
- templates, index templates, mappings, kibana configs for elasticsearch☆21Mar 24, 2023Updated 3 years ago
- OCI Toolkit for VSCode - Functions, Data Science, Resource Manager☆21Nov 12, 2025Updated 7 months ago
- Automation to install, configure, scale test OpenShift and onboard new workloads☆17Oct 4, 2024Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆38Jul 2, 2025Updated 11 months ago
- Langchain + Openclaw = Langclaw☆72Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Prevent cloud misconfigurations during build-time for Terraform, Cloudformation, Kubernetes, Serverless framework, and other infrastructu…☆12Jan 13, 2026Updated 5 months ago
- ☆28Jul 29, 2025Updated 10 months ago
- ☆17Updated this week
- LLMRouter: An Open-Source Library for LLM Routing☆1,980May 13, 2026Updated last month
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Aug 27, 2025Updated 9 months ago
- It's Corn (PogChamps #3) Kaggle Competition 1st Place Winning Solution☆10Oct 4, 2022Updated 3 years ago
- ☆17May 16, 2025Updated last year