aneeshjoy / vllm-windows
Docker compose to run vLLM on Windows
☆64Updated last year
Alternatives and similar repositories for vllm-windows:
Users that are interested in vllm-windows are comparing it to the libraries listed below
- automatically quant GGUF models☆160Updated this week
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆111Updated 8 months ago
- ☆80Updated 2 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 6 months ago
- A fast batching API to serve LLM models☆181Updated 10 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 6 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆114Updated 4 months ago
- ☆111Updated 2 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆147Updated 8 months ago
- ☆28Updated 5 months ago
- A python package for developing AI applications with local LLMs.☆145Updated 2 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 10 months ago
- ☆46Updated 3 weeks ago
- ☆124Updated this week
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆98Updated 4 months ago
- ☆152Updated 7 months ago
- Automated LLM novelist☆42Updated 11 months ago
- A multimodal, function calling powered LLM webui.☆215Updated 5 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 4 months ago
- Easily view and modify JSON datasets for large language models☆71Updated last week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated 2 weeks ago
- ☆28Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- ☆53Updated 9 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 8 months ago
- A pipeline parallel training script for LLMs.☆128Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆231Updated last week
- Ollama chat client in Vue, everything you need to do your private text rpg in browser☆120Updated 4 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 9 months ago