sayakpaul / tt-scale-fluxLinks
Inference-time scaling of diffusion-based image and video generation models.
☆168Updated 3 months ago
Alternatives and similar repositories for tt-scale-flux
Users that are interested in tt-scale-flux are comparing it to the libraries listed below
Sorting:
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆119Updated 7 months ago
- DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space☆183Updated this week
- Official Implementation of weights2weights☆148Updated 6 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆124Updated 6 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆248Updated 2 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆94Updated 3 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆228Updated last month
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆66Updated 4 months ago
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆157Updated 3 weeks ago
- Distilling Diversity and Control in Diffusion Models☆45Updated 5 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆210Updated last week
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆193Updated 3 months ago
- Make self forcing endless. Add cache purging. Add prompt controllability.☆59Updated 3 weeks ago
- [NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps☆100Updated last year
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆119Updated 8 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆302Updated 6 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆137Updated last month
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆100Updated last year
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆129Updated 11 months ago
- Scale-wise Distillation of Diffusion Models☆110Updated 2 weeks ago
- ☆170Updated 2 weeks ago
- ☆69Updated 11 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆160Updated last year
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆156Updated 2 months ago
- (CVPR 2025) Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis☆194Updated 2 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆66Updated 6 months ago
- [ICCV2025] The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆186Updated last month
- ☆90Updated last year
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆184Updated last year
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆121Updated 2 months ago