SystemPanic / vllm-windows
A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)
☆18Updated last week
Alternatives and similar repositories for vllm-windows:
Users that are interested in vllm-windows are comparing it to the libraries listed below
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆50Updated 9 months ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 8 months ago
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆21Updated last year
- Loader extension for tabbyAPI in SillyTavern☆25Updated 8 months ago
- Testbed for the fastest SD pipelines☆35Updated last year
- An extension to use Kokoro TTS in text generation webui☆14Updated last month
- Interact with a AI Game-engine that keep building its rules and world as you play, adapted to your gameplay.☆42Updated 9 months ago
- ☆18Updated 6 months ago
- Attend - to what matters.☆14Updated last month
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- ☆32Updated 8 months ago
- ExLlamaV2 nodes for ComfyUI.☆116Updated 3 months ago
- 8-bit CUDA functions for PyTorch☆25Updated last year
- DeepFloyd IF web UI☆29Updated last year
- Anything Model Bacth Downloader allows you to batch download models from civitai, hugging face easily just through model url.☆15Updated 2 years ago
- LCM test nodes for comfyui☆62Updated last year
- Genertaes control vectors for use with llama.cpp in GGUF format.☆19Updated last week
- ☆18Updated last year
- ☆46Updated 4 months ago
- ☆12Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated 3 months ago
- Model code for inferencing T5☆62Updated 3 weeks ago
- Wan2.1, quantized and optimized so it fits on your 3090/4090☆30Updated last month
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- ComfyUI node for fast neural style transfer☆71Updated 7 months ago
- ☆46Updated 4 months ago
- SDXL conditioning sizing Node for ComfyUI☆28Updated 10 months ago
- Embedding-inspector extension for AUTOMATIC1111/stable-diffusion-webui☆21Updated last year
- Collection of scripts, patches, and custom nodes for ComfyUI☆25Updated 6 months ago