Vchitect / TACALinks
[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
☆39Updated 5 months ago
Alternatives and similar repositories for TACA
Users that are interested in TACA are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Updated 8 months ago
- ☆47Updated 8 months ago
- Video Diffusion Transformers are In-Context Learners☆36Updated last year
- Transition Models☆140Updated 3 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆54Updated 4 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Updated last year
- ☆34Updated last week
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Updated 5 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆28Updated 4 months ago
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆35Updated 6 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated last year
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆70Updated 2 months ago
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆156Updated 2 months ago
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆87Updated 2 weeks ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated last year
- Training-Free Text-Guided Image Editing Using Visual Autoregressive Model☆71Updated 8 months ago
- The official code of "Weak-to-Strong Diffusion with Reflection".☆55Updated 8 months ago
- ThinkGen: Generalized Thinking for Visual Generation☆17Updated last week
- ☆34Updated last year
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆84Updated 3 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆22Updated 9 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆64Updated last year
- Vision Bridge Transformer at Scale☆133Updated last month
- [ArXiv 2025] Follow-Your-Shape: This repo is the official implementation of "Follow-Your-Shape: Shape-Aware Image Editing via Trajectory…☆58Updated last month
- [ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Updated 9 months ago
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆60Updated 3 months ago
- ☆63Updated 2 weeks ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆129Updated 6 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 7 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆43Updated 6 months ago