Docker compose to run vLLM on Windows
☆118Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- Talk to your data. Instantly analyze, visualize, and transform☆20Oct 30, 2025Updated 6 months ago
- ESPNet TTS with Streamlit GUI☆14Apr 30, 2023Updated 3 years ago
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆16Aug 10, 2025Updated 8 months ago
- Documentation and helper scripts for Gigabyte Aero 15x v8 workarounds☆18Oct 30, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multilingual extension of the SesameAILabs Conversational Speech Generation Model☆30Mar 26, 2025Updated last year
- A fast batching API to serve LLM models☆189Apr 26, 2024Updated 2 years ago
- ☆32Mar 26, 2025Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆34Jul 17, 2024Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- This is an LLM interface that you can use to analyze and get insight into diary entries or other documents completely offline.☆16Dec 31, 2023Updated 2 years ago
- Streamlit chatbot with Llama-2-7B-chat☆30Aug 6, 2023Updated 2 years ago
- ☆29Apr 22, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 3 months ago
- RAG AI Agent with Realtime Source Validation (Human in the Loop) - Built with CopilotKit + Pydantic AI☆60Dec 21, 2025Updated 4 months ago
- UDT: UDP-based Data Transfer Protocol☆11Apr 21, 2018Updated 8 years ago
- ☆16Dec 16, 2024Updated last year
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆33Jan 26, 2026Updated 3 months ago
- ☆12Sep 22, 2024Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ace-Step Dataset Generator☆24Sep 27, 2025Updated 7 months ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 4 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Oct 28, 2025Updated 6 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- ☆11May 2, 2022Updated 4 years ago
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Feb 12, 2025Updated last year
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆128Oct 22, 2024Updated last year
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 11 months ago
- Visualize Action Recognition Models☆11Apr 21, 2017Updated 9 years ago
- Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistral☆118Apr 21, 2026Updated 2 weeks ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆48Jul 18, 2025Updated 9 months ago
- Llama cute voice assistant☆28Sep 10, 2023Updated 2 years ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆45Jan 27, 2026Updated 3 months ago
- Automated collection, translation and analysis of open source intelligence using large language models.☆21Feb 2, 2024Updated 2 years ago