Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
☆284Oct 12, 2024Updated last year
Alternatives and similar repositories for flux-fp8-api
Users that are interested in flux-fp8-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cog inference for flux models☆369Jul 31, 2025Updated 8 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆397Jan 8, 2026Updated 3 months ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- A general fine-tuning kit geared toward image/video/audio diffusion models.☆2,804Updated this week
- A unified benchmarking framework for generative styling models in PyTorch☆14Oct 27, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆33Nov 4, 2024Updated last year
- https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching☆426Jul 5, 2025Updated 9 months ago
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆3,782Mar 7, 2026Updated last month
- OneDiff: An out-of-the-box acceleration library for diffusion models.☆1,972Dec 4, 2025Updated 4 months ago
- ☆80Dec 27, 2024Updated last year
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 10 months ago
- Accelerates Flux.1 image generation, just by using this node.☆140Dec 19, 2024Updated last year
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated 3 months ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆212Sep 27, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,306Mar 27, 2025Updated last year
- ☆109Nov 27, 2024Updated last year
- ☆49Feb 23, 2025Updated last year
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 6 months ago
- ☆2,232Nov 8, 2024Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,914Oct 31, 2024Updated last year
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025☆473Mar 19, 2025Updated last year
- Implicit Style-Content Separation using B-LoRA☆397Nov 14, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The ultimate training toolkit for finetuning diffusion models☆10,107Updated this week
- LayerDiffuse in pure diffusers without any GUI☆420Jun 16, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated 3 months ago
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆34Sep 17, 2024Updated last year
- Concept Sliders for Precise Control of Diffusion Models☆1,132Jun 20, 2025Updated 9 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆150Dec 17, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Model Compression Toolbox for Large Language Models and Diffusion Models☆774Aug 14, 2025Updated 8 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,253Feb 16, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆727Dec 2, 2024Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆405Mar 19, 2025Updated last year
- Nodes for image juxtaposition for Flux in ComfyUI☆1,397Jan 9, 2025Updated last year
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"☆13Mar 11, 2026Updated last month
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆41Mar 16, 2026Updated 3 weeks ago
- A pytorch quantization backend for optimum☆1,036Apr 2, 2026Updated last week