☆109Nov 27, 2024Updated last year
Alternatives and similar repositories for FluxKits
Users that are interested in FluxKits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Nov 25, 2025Updated 4 months ago
- [CVPR2026] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens☆53Mar 20, 2026Updated 3 weeks ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 6 months ago
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆163Dec 1, 2025Updated 4 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Mar 13, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Scalable group inference for generating high quality and diverse images with diffusion models.☆42Aug 31, 2025Updated 7 months ago
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆118May 3, 2025Updated 11 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆122Mar 4, 2025Updated last year
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Oct 2, 2024Updated last year
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆66Oct 16, 2024Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- This repo provides a working re-implementation of Latent Adversarial Diffusion Distillation by AMD☆124Jul 12, 2025Updated 9 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆109Sep 27, 2025Updated 6 months ago
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Subjects200K dataset☆129Jan 17, 2025Updated last year
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆75Oct 21, 2025Updated 5 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆119Jul 15, 2024Updated last year
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆34Sep 17, 2024Updated last year
- ☆14Nov 24, 2023Updated 2 years ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,909Jul 3, 2025Updated 9 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆133May 16, 2025Updated 10 months ago
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,558Nov 10, 2025Updated 5 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Let's finetune video generation models!☆547Sep 15, 2025Updated 6 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆514Dec 11, 2024Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,286Oct 31, 2024Updated last year
- Vision Bridge Transformer at Scale☆141Dec 1, 2025Updated 4 months ago
- Crawler and cleaner of data for novelai embedding's training☆21May 22, 2025Updated 10 months ago
- ☆2,232Nov 8, 2024Updated last year
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆49Nov 27, 2024Updated last year
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆52Sep 14, 2024Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆644Oct 16, 2025Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆262Dec 27, 2024Updated last year
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated 2 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops☆30Mar 16, 2024Updated 2 years ago
- [CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆103Feb 28, 2026Updated last month
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆378May 21, 2025Updated 10 months ago