sayakpaul / tt-scale-fluxLinks
Inference-time scaling of diffusion-based image and video generation models.
☆172Updated last month
Alternatives and similar repositories for tt-scale-flux
Users that are interested in tt-scale-flux are comparing it to the libraries listed below
Sorting:
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆134Updated 9 months ago
- Official Implementation of weights2weights☆154Updated 10 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆97Updated 6 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 10 months ago
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆331Updated 3 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆78Updated 10 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆267Updated 5 months ago
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆131Updated last year
- [NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps☆101Updated last year
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆213Updated 3 months ago
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆162Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 8 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆114Updated 7 months ago
- ☆172Updated 4 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆304Updated 10 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆210Updated 2 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Updated last year
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆200Updated 6 months ago
- Distilling Diversity and Control in Diffusion Models☆50Updated 8 months ago
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆125Updated 6 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆410Updated 4 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆103Updated last year
- ☆91Updated last year
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆243Updated 5 months ago
- ☆123Updated last year
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆165Updated 6 months ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆203Updated 11 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆449Updated 2 months ago
- [ICCV2025] The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆195Updated 5 months ago
- Official PyTorch implementation of TokenSet.☆127Updated 9 months ago