A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and enterprise deployment on OpenShift/Kubernetes.
☆459Apr 7, 2026Updated last month
Alternatives and similar repositories for vllm-playground
Users that are interested in vllm-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MCP server providing tools to create Ms Office documents like presentations, emails, spreadsheets and word docs (pptx, docx, eml, xlsx)☆25Apr 11, 2026Updated last month
- Quy Nhon AI Hackathon 2022 - Challenge 2: Review Analytics - Top 1 Solution☆10Sep 21, 2022Updated 3 years ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆1,155Updated this week
- A command-line interface tool for serving LLM using vLLM.☆494Jan 25, 2026Updated 3 months ago
- Empowering Data Driven insights through hands-on projects, SQL challenges and practical tools.☆24Mar 7, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 읽어야 하는 논문들을 관리하고, 읽은 논문들의 기록을 남기는 공간☆31Jan 8, 2020Updated 6 years ago
- 🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the p…☆87Feb 9, 2026Updated 3 months ago
- A Web app demonstrating multimodal image search using Visualized-BGE model☆15Dec 1, 2024Updated last year
- A TypeScript Model Context Protocol (MCP) server to allow LLMs to programmatically construct mind maps to explore an idea space, with enf…☆26Mar 23, 2025Updated last year
- The official codebase for "Experiential Reinforcement Learning" - https://arxiv.org/pdf/2602.13949v1☆68May 8, 2026Updated 2 weeks ago
- ☆10Apr 15, 2026Updated last month
- A pipecat bot demo implementation of a Spotify assistant for creating playlists☆19Oct 14, 2025Updated 7 months ago
- A high-performance and light-weight router for vLLM large scale deployment☆229May 6, 2026Updated 2 weeks ago
- Links to recourses for the Lean Theorem Prover☆13Dec 3, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple Jupyter Notebook to graph a users commit history over time, specifically looking at the author of the xz backdoor.☆23Mar 30, 2024Updated 2 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40May 15, 2026Updated last week
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 4 months ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆8,282Updated this week
- ☆12Mar 1, 2018Updated 8 years ago
- An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training☆15Aug 9, 2024Updated last year
- Deploy your own self-hosted GenAI cluster on Kubernetes using Ollama and OpenWebUI.☆12Feb 16, 2026Updated 3 months ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- A sandbox for showcasing different use cases of LangChain's createAgent☆72Dec 11, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open Knowledge Graph Resources is a static, daily-refreshed catalog of ontology and semantic software records sourced from Wikidata. It p…☆57Updated this week
- OCI Toolkit for VSCode - Functions, Data Science, Resource Manager☆21Nov 12, 2025Updated 6 months ago
- On-cluster FBC catalog content server☆19Mar 3, 2025Updated last year
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆3,262Updated this week
- A Mechanistic View on Video Generation as World Models: State and Dynamics☆39Updated this week
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆38Jul 2, 2025Updated 10 months ago
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- Prevent cloud misconfigurations during build-time for Terraform, Cloudformation, Kubernetes, Serverless framework, and other infrastructu…☆12Jan 13, 2026Updated 4 months ago
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆22Mar 18, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆28Jul 29, 2025Updated 9 months ago
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Aug 27, 2025Updated 8 months ago
- ☆16May 16, 2025Updated last year
- Python, LlamaIndex, LangChain, Docker Compose: 15 Property Graph, 4 RDF , 10 Vector, OpenSearch, Elasticsearch, Alfresco DBs. 13 data sou…☆127Updated this week
- AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…☆23Jul 11, 2025Updated 10 months ago
- A lightweight Text-to-Image Retrieval model [Web App]☆29Dec 6, 2024Updated last year
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆71Dec 17, 2025Updated 5 months ago