aneeshjoy / vllm-windowsLinks
Docker compose to run vLLM on Windows
☆113Updated 2 years ago
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- ☆108Updated 4 months ago
- automatically quant GGUF models☆219Updated 2 weeks ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- ☆127Updated last year
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆246Updated 11 months ago
- Service for testing out the new Qwen2.5 omni model☆61Updated 8 months ago
- Inference service for Qwen2.5-VL-7b model☆208Updated 9 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- ☆51Updated 10 months ago
- ☆210Updated 4 months ago
- A python package for developing AI applications with local LLMs.☆151Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆166Updated 3 weeks ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆118Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆192Updated last year
- A fast batching API to serve LLM models☆189Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆135Updated last year
- A multimodal, function calling powered LLM webui.☆217Updated last year
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆351Updated 11 months ago
- A open webui function for better R1 experience☆78Updated 10 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆103Updated last year
- ☆134Updated last month
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆390Updated 2 weeks ago
- Locally running LLM with internet access☆97Updated 6 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆165Updated last year
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆275Updated last month
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆112Updated 6 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆101Updated 4 months ago