aneeshjoy / vllm-windowsLinks

Docker compose to run vLLM on Windows

☆98

Alternatives and similar repositories for vllm-windows

Users that are interested in vllm-windows are comparing it to the libraries listed below

Sorting:

severian42 / MoA-Ollama-Chat
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…
☆117Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆190Updated this week
chigkim / Ollama-MMLU-Pro
☆95Updated 7 months ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆260Updated 5 months ago
RandomInternetPreson / Lucid_Autonomy
An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…
☆125Updated 9 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆233Updated 6 months ago
kevkid / gguf_gui
☆117Updated 9 months ago
adrienbrault / hf-gguf-to-ollama
Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.
☆116Updated last year
nath1295 / LLMFlex
A python package for developing AI applications with local LLMs.
☆152Updated 7 months ago
Itachi-Uchiha581 / Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
☆102Updated 9 months ago
phildougherty / qwen2.5_omni_chat
Service for testing out the new Qwen2.5 omni model
☆54Updated 3 months ago
remichu-ai / gallama
☆132Updated 3 months ago
CyberTimon / Powerpointer-For-Local-LLMs
Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…
☆260Updated last month
rombodawg / Easy_training
☆49Updated 5 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆185Updated last year
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆184Updated last year
SomeOddCodeGuy / OfflineWikipediaTextApi
This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …
☆98Updated 3 weeks ago
curvedinf / novel-writer
Automated LLM novelist
☆47Updated last year
yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆149Updated 3 months ago
matteoserva / GraphLLM
☆207Updated 2 weeks ago
phildougherty / qwen2.5-VL-inference-openai
Inference service for Qwen2.5-VL-7b model
☆191Updated 4 months ago
umar-mq / chainlit-rag
☆29Updated 10 months ago
molbal / llm-text-completion-finetune
Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.
☆80Updated 5 months ago
mamei16 / LLM_Web_search
An extension for oobabooga/text-generation-webui that enables the LLM to search the web
☆255Updated this week
Finity-Alpha / OpenVoiceChat
Have a natural voice conversation with an LLM
☆252Updated 8 months ago
Fus3n / TwoAI
A simple experiment on letting two local LLM have a conversation about anything!
☆110Updated last year
RandomInternetPreson / Lucid_Vision
This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…
☆57Updated 9 months ago
Rivridis / LLM-Assistant
Locally running LLM with internet access
☆96Updated last month
Fus3n / gem-assist
Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools
☆110Updated last month
neuml / rag
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
☆390Updated 3 months ago