Docker compose to run vLLM on Windows
☆119Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆17Mar 18, 2025Updated last year
- Talk to your data. Instantly analyze, visualize, and transform☆20Oct 30, 2025Updated 6 months ago
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆16Aug 10, 2025Updated 9 months ago
- Documentation and helper scripts for Gigabyte Aero 15x v8 workarounds☆18Oct 30, 2018Updated 7 years ago
- An MCP server that provides AI assistants with screenshot capabilities — both web page capture via Puppeteer and cross-platform system sc…☆24Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A fast batching API to serve LLM models☆189Apr 26, 2024Updated 2 years ago
- Offline tool that processes YouTube videos using WhisperX for automatic transcription and speaker diarization, detects logical fallacies,…☆29Aug 14, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- GoalChain for goal-orientated LLM conversation flows☆69Dec 2, 2024Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆34Jul 17, 2024Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆34Nov 4, 2022Updated 3 years ago
- This is an LLM interface that you can use to analyze and get insight into diary entries or other documents completely offline.☆16Dec 31, 2023Updated 2 years ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆58Aug 21, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆24Jun 1, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 4 months ago
- ☆17Feb 13, 2021Updated 5 years ago
- ☆16Dec 16, 2024Updated last year
- How to install macOS Big Sur on an Unsupported Mac : Example: MacBook Pro Late 2011☆11Feb 27, 2021Updated 5 years ago
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- ☆12Sep 22, 2024Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 5 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Oct 28, 2025Updated 7 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 8 months ago
- Ace-Step Dataset Generator☆26Sep 27, 2025Updated 8 months ago
- Open source RDP client for Android OS☆14May 17, 2012Updated 14 years ago
- Chatbot for the good vibes.☆11Aug 29, 2024Updated last year
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- Example using OpenTelemetry to instrument a FastAPI / LangGraph / Langchain application☆12Nov 12, 2024Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆128Oct 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated last year
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated last year
- 3x Faster Inference; Unofficial implementation of EAGLE Speculative Decoding☆84Jul 3, 2025Updated 10 months ago
- Llama cute voice assistant☆28Sep 10, 2023Updated 2 years ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 4 months ago
- Ubuntu용 한글뷰어☆12Jan 3, 2018Updated 8 years ago
- Unofficial Pytorch Implementation of "A Simple Framework for Contrastive Learning of Visual Representations"☆10Mar 11, 2020Updated 6 years ago