aneeshjoy / vllm-windowsLinks
Docker compose to run vLLM on Windows
☆98Updated last year
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- automatically quant GGUF models☆190Updated this week
- ☆95Updated 7 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆260Updated 5 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆125Updated 9 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆233Updated 6 months ago
- ☆117Updated 9 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆116Updated last year
- A python package for developing AI applications with local LLMs.☆152Updated 7 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 9 months ago
- Service for testing out the new Qwen2.5 omni model☆54Updated 3 months ago
- ☆132Updated 3 months ago
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆260Updated last month
- ☆49Updated 5 months ago
- A fast batching API to serve LLM models☆185Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆184Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆98Updated 3 weeks ago
- Automated LLM novelist☆47Updated last year
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆149Updated 3 months ago
- ☆207Updated 2 weeks ago
- Inference service for Qwen2.5-VL-7b model☆191Updated 4 months ago
- ☆29Updated 10 months ago
- Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.☆80Updated 5 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆255Updated this week
- Have a natural voice conversation with an LLM☆252Updated 8 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated 9 months ago
- Locally running LLM with internet access☆96Updated last month
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆110Updated last month
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆390Updated 3 months ago