tang-bd / fuse-dit
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆27Updated this week
Alternatives and similar repositories for fuse-dit
Users that are interested in fuse-dit are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆51Updated 2 weeks ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆101Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆42Updated 8 months ago
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆36Updated last year
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆58Updated 2 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆59Updated last month
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆75Updated 11 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 6 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆49Updated last month
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆34Updated 3 months ago
- RS-IMLE☆38Updated 5 months ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated last year
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42Updated last year
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆57Updated 9 months ago
- ☆30Updated 2 months ago
- The official repo of continuous speculative decoding☆26Updated last month
- The official implementation of Distribution Backtracking Distillation for One-step Diffusion Models☆28Updated 3 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- [ICLR 2024] Code for FreeNoise based on LaVie☆35Updated last year
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆26Updated 6 months ago
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆41Updated 10 months ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆53Updated 5 months ago
- ☆45Updated 2 months ago
- Code for paper "Principal Components" Enable A New Language of Images☆40Updated last month
- ☆78Updated last year
- ☆70Updated 6 months ago
- Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆32Updated 3 weeks ago