leejet / stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
☆3,264Updated 2 weeks ago
Related projects: ⓘ
- Tensor library for machine learning☆10,869Updated this week
- Lightweight inference library for ONNX files, written in C++. It can run SDXL on a RPI Zero 2 but also Mistral 7B on desktops and servers…☆1,823Updated last week
- High-speed Large Language Model Serving on PCs with Consumer-grade GPUs☆7,877Updated 2 weeks ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,493Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆4,727Updated last month
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,730Updated 11 months ago
- Fast stable diffusion on CPU☆1,401Updated this week
- ☆1,251Updated 10 months ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,403Updated last month
- Python bindings for llama.cpp☆7,723Updated this week
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,548Updated 6 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,641Updated 4 months ago
- Llama 2 Everywhere (L2E)☆1,510Updated last month
- This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, …☆604Updated 5 months ago
- Run GGUF models easily with a KoboldAI UI. One File. Zero Install.☆4,836Updated this week
- Official Code for Stable Cascade☆6,511Updated last month
- An Open Source text-to-speech system built by inverting Whisper.☆3,779Updated 3 months ago
- tiny vision language model☆4,893Updated 3 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,089Updated last week
- ☆7,647Updated 5 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,198Updated last month
- Fast inference engine for Transformer models☆3,229Updated last week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,495Updated 2 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech☆684Updated 2 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆5,911Updated this week
- 4 bits quantization of LLaMA using GPTQ☆2,982Updated 2 months ago
- Generative models for conditional audio generation☆2,528Updated 2 months ago
- Foundational model for human-like, expressive TTS☆3,721Updated last month
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,459Updated 2 months ago
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,958Updated 5 months ago