aneeshjoy / vllm-windowsLinks
Docker compose to run vLLM on Windows
☆99Updated last year
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- automatically quant GGUF models☆200Updated this week
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- ☆121Updated 10 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆260Updated 6 months ago
- A python package for developing AI applications with local LLMs.☆152Updated 8 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆126Updated 10 months ago
- ☆132Updated 4 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 10 months ago
- Service for testing out the new Qwen2.5 omni model☆57Updated 4 months ago
- ☆209Updated last week
- ☆50Updated 7 months ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆152Updated 4 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated last year
- ☆99Updated 3 weeks ago
- Automated LLM novelist☆46Updated last year
- A fast batching API to serve LLM models☆187Updated last year
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆240Updated 7 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated 10 months ago
- Experimental LLM Inference UX to aid in creative writing☆122Updated 9 months ago
- A open webui function for better R1 experience☆79Updated 6 months ago
- private-machine is an AI companion system with emotion, needs and goals simulation based on LIDA cognitive architecture. Many agents for …☆23Updated this week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated last year
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆35Updated last year
- A multimodal, function calling powered LLM webui.☆216Updated 11 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆117Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆100Updated 3 weeks ago
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆200Updated 3 months ago
- Link you Ollama models to LM-Studio☆142Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆160Updated last year