aneeshjoy / vllm-windowsView external linksLinks
Docker compose to run vLLM on Windows
☆114Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆17Aug 10, 2025Updated 6 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- ESPNet TTS with Streamlit GUI☆13Apr 30, 2023Updated 2 years ago
- Documentation and helper scripts for Gigabyte Aero 15x v8 workarounds☆17Oct 30, 2018Updated 7 years ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆51Aug 21, 2025Updated 5 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆34Jul 17, 2024Updated last year
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated last month
- GoalChain for goal-orientated LLM conversation flows☆71Dec 2, 2024Updated last year
- LLM FX: A LLM Server Desktop Client free for everyone!☆33Dec 19, 2025Updated last month
- This is an LLM interface that you can use to analyze and get insight into diary entries or other documents completely offline.☆16Dec 31, 2023Updated 2 years ago
- ☆28Apr 22, 2024Updated last year
- TLS & API keys for your LLM APIs☆19Dec 17, 2025Updated last month
- Playing with CSM☆22Mar 14, 2025Updated 10 months ago
- ☆17Dec 16, 2024Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆23Sep 1, 2025Updated 5 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 8 months ago
- Automated LLM novelist☆46Apr 11, 2024Updated last year
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated 10 months ago
- ☆128Nov 9, 2024Updated last year
- ☆24Jun 1, 2024Updated last year
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated 8 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 10 months ago
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆57Aug 1, 2024Updated last year
- simple terminal-based AI coding agent. This is for learning purposes more than a final working app.☆26Mar 6, 2025Updated 11 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- The full stack Next.js starter project for vibe coding.☆26May 27, 2025Updated 8 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆92Feb 5, 2026Updated last week
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Oct 21, 2025Updated 3 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆33Feb 12, 2025Updated last year
- Orpheus Chat WebUI☆76Mar 27, 2025Updated 10 months ago
- Crow is a Desktop AI Assistant☆32Aug 9, 2024Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated 10 months ago
- Semantic-Fleet serves as a specialized extension hub for the Semantic-Kernel ecosystem. It houses a diverse array of connectors designed …☆31Oct 27, 2025Updated 3 months ago
- ☆54May 28, 2025Updated 8 months ago