aneeshjoy / vllm-windowsLinks
Docker compose to run vLLM on Windows
☆92Updated last year
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- automatically quant GGUF models☆187Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆257Updated 4 months ago
- ☆95Updated 6 months ago
- A python package for developing AI applications with local LLMs.☆150Updated 6 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆125Updated 8 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆348Updated 5 months ago
- Service for testing out the new Qwen2.5 omni model☆54Updated 2 months ago
- A fast batching API to serve LLM models☆183Updated last year
- ☆204Updated last month
- Inference service for Qwen2.5-VL-7b model☆188Updated 3 months ago
- ☆116Updated 8 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 8 months ago
- ☆131Updated 2 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 11 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- ☆49Updated 4 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆146Updated 2 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆232Updated 5 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆26Updated 4 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆98Updated this week
- Efficient visual programming for AI language models☆364Updated 2 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆181Updated 9 months ago
- Own your AI, search the web with it🌐😎☆86Updated 6 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated last year
- ☆29Updated 9 months ago
- a Repository of Open-WebUI tools to use with your favourite LLMs☆247Updated this week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆116Updated last year
- High level tool use for LLMs☆34Updated 11 months ago
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆257Updated last month