dougeeai/llama-cpp-python-wheels

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dougeeai/llama-cpp-python-wheels)

dougeeai / llama-cpp-python-wheels

Pre-built wheels for llama-cpp-python across platforms and CUDA versions

☆80

Alternatives and similar repositories for llama-cpp-python-wheels

Users that are interested in llama-cpp-python-wheels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mengqin / SageAttention
View on GitHub
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-t…
☆69Jan 19, 2026Updated 6 months ago
JamePeng / llama-cpp-python
View on GitHub
Python bindings for llama.cpp
☆481Updated this week
paolaoshi / ComfyUI-llama_Dapao
View on GitHub
本地调用各种llama模型的comfyui节点，包含gemma4和Qwen3.5等常用模型
☆37Jul 20, 2026Updated last week
sdbds / SageAttention-for-windows
View on GitHub
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossi…
☆151Jul 9, 2026Updated 2 weeks ago
seyf1elislam / OneClick_LLM_API_onColab
View on GitHub
Run gguf LLM models in Latest Version TextGen-webui and koboldcpp
☆20Aug 6, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
woct0rdho / SpargeAttn
View on GitHub
Fork of SpargeAttention (SparseSageAttention) for Windows wheels and easy installation
☆37May 14, 2026Updated 2 months ago
wildminder / AI-windows-whl
View on GitHub
Pre-compiled Python whl for Flash-attention, SageAttention, NATTEN, xFormer etc
☆729Updated this week
lrzjason / Comfyui-LatentUtils
View on GitHub
a set of utils for comfyui latent operation
☆101Dec 6, 2025Updated 7 months ago
workordie / ComfyUI-Qwen3.5
View on GitHub
ComfyUI custom node for Qwen3.5-9B unified multimodal model
☆39Mar 13, 2026Updated 4 months ago
joyfoxai / LTX2-ICEdit-Insight
View on GitHub
基于LTX2.3的向高清修复、视频去水印等编辑任务，提出统一的时空扩散技术路线：在 LTX-2.3 框架下以任务感知型适配器（Task-Aware Adapters）替代单一 LoRA 增量，通过轻量 Control-Transformer、零初始化残差连接与分层解耦注入机…
☆135Jun 1, 2026Updated last month
eddyhhlure1Eddy / ComfyUI-UniversalBlockSwap
View on GitHub
ComfyUI-UniversalBlockSwap
☆49Sep 18, 2025Updated 10 months ago
adambarbato / ComfyUI-Sa2VA
View on GitHub
A ComfyUI node implementation for ByteDance's Sa2VA
☆96Dec 22, 2025Updated 7 months ago
fantaskiss / ComfyUI-Qwen3_VQA_enhanced
View on GitHub
对原始的qwen3_VQA节点的增强。其他功能保持不变，只增加了自动扫描prompt_generator文件夹功能。不新建节点组，只对原节点组进行增加与修改。
☆22Mar 2, 2026Updated 4 months ago
KLL535 / ComfyUI_Simple_Qwen3-VL-gguf
View on GitHub
Simple gguf LLM Qwen3-VL, Qwen3.5, Qwen3.6, Gemma4 and others model loader for Comfy-UI.
☆82Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
billwuhao / ComfyUI_ASR
View on GitHub
带时间戳、标点符号，自动语音识别。给视频自动添加字幕。
☆35Feb 9, 2026Updated 5 months ago
judian17 / ComfyUI-OpenPose-Editor-jd
View on GitHub
☆57Nov 2, 2025Updated 8 months ago
Qo-qiao / ComfyUI-omni-llm
View on GitHub
☆38Updated this week
spacepxl / ComfyUI-VAE-Utils
View on GitHub
☆219May 17, 2026Updated 2 months ago
woct0rdho / ComfyUI-RadialAttn
View on GitHub
RadialAttention in ComfyUI native workflow
☆120Dec 19, 2025Updated 7 months ago
HM-RunningHub / ComfyUI_RH_MOVA
View on GitHub
This is a ComfyUI plugin for https://github.com/OpenMOSS/MOVA
☆22Jan 30, 2026Updated 5 months ago
siraxe / ComfyUI-LTX-FDG
View on GitHub
☆25Mar 10, 2026Updated 4 months ago
shumoLR / Comfyui_SynVow_Qwen3ASR
View on GitHub
A ComfyUI speech recognition plugin based on [Qwen3-ASR](https://github.com/QwenLM/Qwen3-ASR).
☆35Feb 6, 2026Updated 5 months ago
princepainter / ComfyUI-PainterLTXV2
View on GitHub
ComfyUI custom nodes for LTXV audio-video separation sampling and latent preparation. PainterSamplerLTXV: Advanced sampler with external…
☆106Jan 20, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
woct0rdho / SageAttention
View on GitHub
Fork of SageAttention for Windows wheels and easy installation
☆889Jul 17, 2026Updated last week
kijai / ComfyUI_essentials
View on GitHub
☆14Jan 22, 2025Updated last year
xmarre / ComfyUI-Image-Conveyor
View on GitHub
A Vue-node ComfyUI image queue that lets you drag in unlimited images, organize them visually, and process them sequentially across queue…
☆25May 16, 2026Updated 2 months ago
DragonDiffusionbyBoyo / Boyonodes
View on GitHub
A set of Comfyui nodes
☆16Jun 21, 2026Updated last month
Zar4X / ComfyUI-Batch-Process
View on GitHub
A ComfyUI node that could help you batch process files
☆15Feb 12, 2026Updated 5 months ago
Windecay / ComfyUI-ReservedVRAM
View on GitHub
A simple node that can dynamically adjust the reserved memory of a workflow in real-time, used to avoid the utilization of shared memory.
☆387Jul 4, 2026Updated 3 weeks ago
phazei / ComfyUI-Enhancement-Utils
View on GitHub
☆27Jun 7, 2026Updated last month
princepainter / Comfyui-PainterFluxImageEdit
View on GitHub
All-in-one Flux2 text-to-image & image editing node that combines CLIP encoding, VAE encoding, and reference latent injection.
☆129Feb 7, 2026Updated 5 months ago
smthemex / ComfyUI_JoyAI_Echo
View on GitHub
Pushing the Frontier of Long Video Generation Standalone, inference-only release for minute-level multi-shot audio-video generation with…
☆58Jun 23, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
invisietch / Chatterbox
View on GitHub
Multi-turn dataset management tool for LLM trainers
☆13Mar 31, 2025Updated last year
wallen0322 / ComfyUI-SageAttention3
View on GitHub
An experimental node
☆26Jan 13, 2026Updated 6 months ago
naxci1 / ComfyUI-FlashVSR_Stable
View on GitHub
High-performance Video Super Resolution for ComfyUI with VRAM optimization.
☆60Feb 13, 2026Updated 5 months ago
1038lab / ComfyUI-FlashVSR
View on GitHub
Powerful ComfyUI custom node built on the FlashVSR V1.1 model, facilitating real-time diffusion-based video super-resolution for streamin…
☆115Nov 17, 2025Updated 8 months ago
kijai / ComfyUI-NativeLooping_testing
View on GitHub
Temporary repository for development
☆25Jun 15, 2026Updated last month
woct0rdho / triton-windows
View on GitHub
Fork of the Triton language and compiler for Windows support and easy installation
☆1,957Feb 18, 2026Updated 5 months ago
fblissjr / ComfyUI-QwenImageWanBridge
View on GitHub
qwen-image and wan2.2 wan2.1 bridge
☆190Apr 18, 2026Updated 3 months ago