A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)
☆323Mar 2, 2026Updated this week
Alternatives and similar repositories for vllm-windows
Users that are interested in vllm-windows are comparing it to the libraries listed below
Sorting:
- Advanced CLI diffusion inference/training suite based on Musubi Tuner☆40Updated this week
- Deepspeed windows information☆44Mar 9, 2024Updated last year
- ☆10Jul 20, 2024Updated last year
- An Extension for Automatic1111 Webui that makes the interface easier to use on mobile (portrait)☆16Apr 16, 2024Updated last year
- ☆32Jul 20, 2024Updated last year
- a neural network trainer for weebs☆14Feb 23, 2026Updated last week
- ☆24Updated this week
- Fine-tuning code for CLIP models☆268Jan 28, 2026Updated last month
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆37Feb 5, 2025Updated last year
- A Multipurpose toolkit for managing, editing and creating models.☆12Aug 13, 2024Updated last year
- ☆11Aug 19, 2025Updated 6 months ago
- Money tracking - Android App for planning, tracking your spending, monitoring your credit and budget - UNIBO 2016/2017☆12Sep 14, 2017Updated 8 years ago
- ☆18Nov 28, 2025Updated 3 months ago
- flux1非官方的量化模型(flux1 unofficial quantize model)☆12Aug 14, 2024Updated last year
- Fork of the Triton language and compiler for Windows support and easy installation☆1,869Feb 18, 2026Updated 2 weeks ago
- Controlnet module for Wan2.1☆30Aug 4, 2025Updated 7 months ago
- combine source code files into single prompt to chat with your repository☆14May 15, 2024Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- stable-diffusion-webui-images-browser☆14Jan 12, 2023Updated 3 years ago
- ICDE 2025 Paper, Grounding Natural Language to SQL Translation with Data-Based Self-Explanations☆17May 24, 2025Updated 9 months ago
- pre-training llama3 using chinese☆13May 1, 2024Updated last year
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- Fork of ACE-Step v1.0 for LoRA training with < 10 GB VRAM☆66Feb 3, 2026Updated last month
- ☆18Dec 2, 2024Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 9 months ago
- Docker compose to run vLLM on Windows☆116Jan 1, 2024Updated 2 years ago
- A comprehensive codebase for training and finetuning Image <> Latent models.☆50Mar 1, 2025Updated last year
- Extension/Script for Stable Diffusion UI by AUTOMATIC1111 https://github.com/AUTOMATIC1111/stable-diffusion-webui☆19Feb 10, 2023Updated 3 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Apr 24, 2024Updated last year
- Clip I/O extension for Stable Diffusion Web UI☆20Oct 21, 2023Updated 2 years ago
- ☆18Apr 18, 2025Updated 10 months ago
- ☆15Jun 20, 2024Updated last year
- dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and relate…☆42Oct 15, 2025Updated 4 months ago
- Custom LORA training on DynamiCrafter☆18Jul 26, 2024Updated last year
- ☆18Jan 5, 2025Updated last year
- ☆11Feb 25, 2026Updated last week
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39May 8, 2024Updated last year
- Custom script for AUTOMATIC1111's stable-diffusion-webui that adds more features to the standard xy grid☆15Jan 2, 2023Updated 3 years ago
- Executable State Dict Recipes☆81Updated this week