A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.
☆24Jul 18, 2025Updated 7 months ago
Alternatives and similar repositories for flux-faster
Users that are interested in flux-faster are comparing it to the libraries listed below
Sorting:
- Forward-only Diffusion Probabilistic Models☆28May 23, 2025Updated 9 months ago
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆16Jul 19, 2024Updated last year
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated last month
- ☆52May 19, 2025Updated 9 months ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- deploy yolov5 by Opencv and TensorRT in Python and CPP☆26Mar 23, 2022Updated 3 years ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated last year
- ☆34Feb 3, 2025Updated last year
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆79Aug 12, 2024Updated last year
- ☆79Dec 27, 2024Updated last year
- ☆32Feb 11, 2026Updated 3 weeks ago
- In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.☆37Feb 16, 2026Updated 2 weeks ago
- Making Flux go brrr on GPUs.☆163Jan 5, 2026Updated last month
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Embedded graphics library to create beautiful UIs for any MCU, MPU and display type.☆11Apr 29, 2024Updated last year
- Community maintained hardware plugin for vLLM on AWS Neuron☆23Updated this week
- ☆43Apr 2, 2024Updated last year
- DataHen useragent tool is a Golang package and standalone tool that generates a random combination of millions of user-agents strings. Cu…☆10Jun 3, 2021Updated 4 years ago
- use yolov3 onnx model to implement object detection☆11Apr 25, 2019Updated 6 years ago
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆25Jul 26, 2025Updated 7 months ago
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 4 years ago
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- Classification model using LSTM Bidirectional model of Amazon review data from kaggle data https://www.kaggle.com/bittlingmayer/amazonrev…☆11Jul 21, 2020Updated 5 years ago
- Analyzing Google Play Store reviews of the Indonesian streaming platform Vidio through topic modeling with the assistance of GPT.☆11Mar 14, 2024Updated last year
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Hi, I'm Harmony the Hummingbird! Let's work together on whatever you care about.☆12May 3, 2024Updated last year
- conflict-free replicated storage mixing internal Go maps and Redis to achieve both speed and persistence.☆13May 26, 2016Updated 9 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆25Jan 27, 2026Updated last month
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- ☆23Jul 11, 2025Updated 7 months ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- Sea ORM adapter for casbin-rs☆11Aug 6, 2024Updated last year
- Hybrid Deep-learning and Iterative Reconstruction Scheme for Medical Imaging Reconstruction☆11Sep 26, 2023Updated 2 years ago
- Implemetation of "Pixel-In-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild"☆11Jul 6, 2023Updated 2 years ago
- "Causality: Models, Reasoning, and Inference-Judea Pearl(2009)"中文翻译及学习笔记☆15Feb 18, 2022Updated 4 years ago
- A homomorphic polynomial public key KEM and digital signature (DS) scheme.☆14Jan 26, 2026Updated last month
- ☆13Jan 7, 2025Updated last year