aneeshjoy / vllm-windowsLinks
Docker compose to run vLLM on Windows
☆103Updated last year
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- automatically quant GGUF models☆204Updated last week
- ☆122Updated 11 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆240Updated 8 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆259Updated 7 months ago
- Service for testing out the new Qwen2.5 omni model☆60Updated 5 months ago
- ☆102Updated last month
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated 11 months ago
- ☆207Updated last month
- ☆51Updated 7 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆157Updated 5 months ago
- ☆132Updated 5 months ago
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆201Updated 3 months ago
- A fast batching API to serve LLM models☆187Updated last year
- A python package for developing AI applications with local LLMs.☆150Updated 9 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆137Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)☆175Updated last week
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated last year
- A multimodal, function calling powered LLM webui.☆216Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆100Updated last month
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- API server for Instant voice cloning by MyShell.☆103Updated last year
- Inference service for Qwen2.5-VL-7b model☆200Updated 6 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆248Updated 5 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆270Updated this week
- A pipeline parallel training script for LLMs.☆158Updated 5 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆28Updated 6 months ago
- A programming framework for agentic AI 🤖☆23Updated 10 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆120Updated last month
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆67Updated 5 months ago