Docker compose to run vLLM on Windows
☆117Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 10 months ago
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆18Mar 18, 2025Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 6 months ago
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆16Aug 10, 2025Updated 8 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Multilingual extension of the SesameAILabs Conversational Speech Generation Model☆31Mar 26, 2025Updated last year
- A fast batching API to serve LLM models☆189Apr 26, 2024Updated last year
- ☆32Mar 26, 2025Updated last year
- Categorize credit card transactions using a local large language model similar to GPT3☆15Dec 29, 2023Updated 2 years ago
- Unofficial API Wrapper for Deepseek (chat.deepseek.com)☆75Aug 5, 2025Updated 8 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆34Jul 17, 2024Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- OpenAI GPT model to build your personal assistant in IoT devices. Just like Alexa, Google Assistant, Siri, etc. but with your own skills,…☆12Aug 7, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆57Aug 21, 2025Updated 7 months ago
- ☆24Jun 1, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- RAG AI Agent with Realtime Source Validation (Human in the Loop) - Built with CopilotKit + Pydantic AI☆57Dec 21, 2025Updated 3 months ago
- ☆10Nov 16, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- Recursive Self-Aggregation evals on ARC-AGI☆29Jan 26, 2026Updated 2 months ago
- ☆12Sep 22, 2024Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 7 months ago
- ☆11May 2, 2022Updated 3 years ago
- Chatbot for the good vibes.☆11Aug 29, 2024Updated last year
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- Ollama Client – Chat with Local LLMs Inside Your Browser A lightweight, privacy‑first Chrome extension to chat with local LLMs via Ollam…☆33Apr 6, 2026Updated last week
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated last year
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- This repository provides FlashPortrait custom nodes for ComfyUI.☆26Dec 29, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆47Jul 18, 2025Updated 8 months ago
- Simple node proxy for llama-server that enables MCP use☆19May 10, 2025Updated 11 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 10 months ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 10 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 2 months ago
- KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines☆23Aug 14, 2024Updated last year
- Automated LLM novelist☆47Apr 11, 2024Updated 2 years ago