aredden / flux-fp8-apiLinks

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
264Updated 7 months ago

Alternatives and similar repositories for flux-fp8-api

Users that are interested in flux-fp8-api are comparing it to the libraries listed below

Sorting: