furiosa-ai / uncageLinks
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
☆18Updated 5 months ago
Alternatives and similar repositories for uncage
Users that are interested in uncage are comparing it to the libraries listed below
Sorting:
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆44Updated 3 weeks ago
- Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation☆31Updated 5 months ago
- ☆11Updated last month
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Updated 5 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 8 months ago
- [Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models☆55Updated 3 months ago
- ☆31Updated 4 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆38Updated 6 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Updated last year
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Updated 3 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆99Updated 3 months ago
- ☆132Updated 6 months ago
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆125Updated 6 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆46Updated 3 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆80Updated 8 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 10 months ago
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation☆36Updated 5 months ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model☆55Updated 7 months ago
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆97Updated 2 weeks ago
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆51Updated 2 weeks ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆32Updated 3 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆109Updated 3 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆122Updated 5 months ago
- An official implementation of SwapAnyone.☆73Updated 10 months ago
- Test-time Scaling for VAR models☆29Updated 4 months ago
- ☆33Updated 2 months ago
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆32Updated last month
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆34Updated 3 weeks ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆64Updated last week
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆29Updated 10 months ago