Vchitect / TACALinks
[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
☆30Updated last month
Alternatives and similar repositories for TACA
Users that are interested in TACA are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆27Updated 4 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆45Updated 2 weeks ago
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆47Updated 5 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆37Updated 2 months ago
- [ICCV 2025] FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing☆51Updated 3 weeks ago
- Video Diffusion Transformers are In-Context Learners☆26Updated 7 months ago
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆64Updated last month
- ☆33Updated 9 months ago
- Official implementation of “ACE: Anti-Editing Concept Erasure in Text-to-Image Models”☆11Updated 3 months ago
- The official code of "Weak-to-Strong Diffusion with Reflection".☆48Updated 3 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 11 months ago
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆55Updated 4 months ago
- ☆45Updated 4 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆143Updated this week
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆68Updated last month
- DC-Gen: Accelerating Diffusion Models with Compressed Latent Space☆53Updated 2 weeks ago
- Implementation of paper EditCLIP: Representation Learning for Image Editing (ICCV 2025)☆26Updated 2 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆55Updated 10 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆67Updated this week
- Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆70Updated 3 months ago
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆30Updated last year
- a collection of awesome autoregressive visual generation models☆76Updated 4 months ago
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆64Updated last month
- The code of Edit-Your-Motion☆14Updated last year
- [ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models☆44Updated last year
- SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation (CVPR 2024)☆66Updated 3 weeks ago
- DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching (CVPR'25)☆19Updated 2 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆57Updated 7 months ago
- ☆40Updated 7 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆53Updated 4 months ago