Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++
☆5,562Mar 15, 2026Updated this week
Alternatives and similar repositories for stable-diffusion.cpp
Users that are interested in stable-diffusion.cpp are comparing it to the libraries listed below
Sorting:
- Tensor library for machine learning☆14,220Feb 27, 2026Updated 2 weeks ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆857Nov 16, 2024Updated last year
- LLM inference in C/C++☆98,098Updated this week
- Port of OpenAI's Whisper model in C/C++☆47,474Mar 5, 2026Updated 2 weeks ago
- Inference Llama 2 in one file of pure C☆19,262Aug 6, 2024Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆552Jun 19, 2025Updated 9 months ago
- Fast stable diffusion on CPU and AI PC☆2,018Jan 10, 2026Updated 2 months ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,563Mar 23, 2025Updated 11 months ago
- ☆1,274Oct 24, 2023Updated 2 years ago
- Distribute and run LLMs with a single file.☆23,794Updated this week
- Universal LLM Deployment Engine with ML Compilation☆22,194Mar 9, 2026Updated last week
- This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, …☆633May 29, 2025Updated 9 months ago
- Run GGUF models easily with a KoboldAI UI. One File. Zero Install.☆9,721Updated this week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,749Updated this week
- Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation☆265Oct 31, 2023Updated 2 years ago
- Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but…☆2,032Jan 20, 2026Updated last month
- stable-diffusion.cpp bindings for python☆105Feb 7, 2026Updated last month
- Stable Diffusion in NCNN with c++, supported txt2img and img2img☆1,060Jul 3, 2023Updated 2 years ago
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,305Mar 27, 2025Updated 11 months ago
- Python bindings for llama.cpp☆10,058Aug 15, 2025Updated 7 months ago
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆569Aug 8, 2023Updated 2 years ago
- High-speed Large Language Model Serving for Local Deployment☆8,834Jan 24, 2026Updated last month
- Stable Diffusion GUI written in C++☆88Oct 3, 2025Updated 5 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,460Mar 4, 2026Updated 2 weeks ago
- Fast, flexible LLM inference☆6,681Feb 27, 2026Updated 2 weeks ago
- C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)☆2,961Jul 31, 2024Updated last year
- Minimalist ML framework for Rust☆19,669Updated this week
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆309Apr 11, 2024Updated last year
- LLM training in simple, raw C/CUDA☆29,143Jun 26, 2025Updated 8 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆1,809Updated this week
- ggml implementation of BERT☆496Feb 23, 2024Updated 2 years ago
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆106,179Updated this week
- The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.☆46,278Updated this week
- SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing☆6,984Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,709Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆73,479Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,911May 3, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆33,085Updated this week
- Port of Meta's Encodec in C/C++☆228Dec 4, 2024Updated last year