Vchitect/TACA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Vchitect/TACA)

Vchitect / TACA

[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

☆42

Alternatives and similar repositories for TACA

Users that are interested in TACA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Vchitect / DCM
View on GitHub
[ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
☆206Jun 8, 2025Updated last year
WeChatCV / NovaEdit
View on GitHub
[CVPR26] Nova: Video Editing via single/multiple frame references
☆49Mar 4, 2026Updated 4 months ago
yukangcao / FreeMorph
View on GitHub
[ICCV'25] FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
☆91Jul 24, 2025Updated 11 months ago
xbyym / StableWorld
View on GitHub
StableWorld: Towards Stable and Consistent Long Interactive Video Generation
☆97Mar 18, 2026Updated 4 months ago
Vchitect / FasterCache
View on GitHub
[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
☆263Dec 27, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
furiosa-ai / uncage
View on GitHub
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
☆17Aug 12, 2025Updated 11 months ago
120L020904 / ACE
View on GitHub
Official implementation of “ACE: Anti-Editing Concept Erasure in Text-to-Image Models”
☆17Jan 5, 2026Updated 6 months ago
cszy98 / PLACE
View on GitHub
[CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis
☆44Mar 5, 2024Updated 2 years ago
liuxiaoyu1104 / SmartControl
View on GitHub
[ECCV 2024] Enhancing ControlNet for Handling Rough Visual Conditions
☆109Sep 4, 2024Updated last year
Stability-AI / marble
View on GitHub
☆40May 8, 2026Updated 2 months ago
Mrduckk / DCID
View on GitHub
Image DeMoiréing Using Dual Camera Fusion on Mobile Phones (ICME 2025)
☆19Jun 12, 2025Updated last year
smthemex / ComfyUI_DICE_Talk
View on GitHub
Use ‘DICE-Talk’ in ComfyUI，which is a method about 'Correlation-Aware Emotional Talking Portrait Generation'.
☆25May 7, 2025Updated last year
NadavSc / Diff-Mamba
View on GitHub
☆22Jan 23, 2026Updated 5 months ago
gy8888 / RelationAdapter
View on GitHub
Code Implementation of “RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers”
☆33Apr 13, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YujiaHu1109 / IEAP
View on GitHub
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
☆118Sep 27, 2025Updated 9 months ago
maoXyzt / SynBody
View on GitHub
☆27Oct 5, 2023Updated 2 years ago
fudan-generative-vision / MixFlow
View on GitHub
[CVPR 2026] MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
☆21Dec 23, 2025Updated 6 months ago
jacklishufan / Reflect-DiT
View on GitHub
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
☆56Aug 16, 2025Updated 11 months ago
judian17 / ComfyUI-UniWorld-jd17
View on GitHub
Unofficial ComfyUI implementation of UniWorld.
☆21Jun 10, 2025Updated last year
luowyang / Defusion
View on GitHub
Official repository for CVPR2025 paper "Visual-InstrOfficial repository for CVPR2025 paper "Visual-Instructed Degradation Diffusion for A…
☆15Mar 23, 2025Updated last year
GaParmar / group-inference
View on GitHub
Scalable group inference for generating high quality and diverse images with diffusion models.
☆43Aug 31, 2025Updated 10 months ago
Vchitect / LongVie
View on GitHub
☆334Jan 24, 2026Updated 5 months ago
star-kwon / FCDM
View on GitHub
[CVPR 2026] Official repository for "Reviving ConvNeXt for Efficient Convolutional Diffusion Models"
☆71Mar 26, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
cszhilu1998 / SelfHDR
View on GitHub
[ICLR 2024] Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes
☆79May 23, 2026Updated last month
Tr1stesse / DirectEdit
View on GitHub
[ICML 2026] Official implementation for "DirectEdit: Step-Level Accurate Inversion for Flow-Based Image Editing".
☆27May 5, 2026Updated 2 months ago
yinzhicun / RefSTAR
View on GitHub
RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction (AAAI 2026)
☆24Apr 13, 2026Updated 3 months ago
YBYBZhang / Tool-R1
View on GitHub
Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"
☆20Sep 16, 2025Updated 10 months ago
Monalissaa / FreeCus
View on GitHub
[ICCV`2025] FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers
☆16Jul 22, 2025Updated last year
CUC-MIPG / UnifyEdit
View on GitHub
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
☆13Dec 29, 2024Updated last year
bytedance / SuperEdit
View on GitHub
[ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing
☆165Jun 26, 2025Updated last year
Vchitect / Cut2Next
View on GitHub
Cut2Next: Generating Next Shot via In-Context Tuning
☆33Aug 21, 2025Updated 11 months ago
wyhlovecpp / GPT-Image-Edit
View on GitHub
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆243Aug 15, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shallowdream204 / DiCo
View on GitHub
[NeurIPS 2025 Spotlight] DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
☆72Feb 12, 2026Updated 5 months ago
arthur-qiu / FreeTraj
View on GitHub
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
☆114Sep 19, 2025Updated 10 months ago
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
YaoShunyu19 / MDIQA
View on GitHub
☆19Sep 4, 2025Updated 10 months ago
WikiChao / FreSca
View on GitHub
[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model
☆55May 31, 2025Updated last year
liuxiaoyu1104 / AnimateAnywhere
View on GitHub
[TMM 2026] Rouse the Background in Human Image Animation
☆30Apr 24, 2025Updated last year
hyz317 / CHARM
View on GitHub
[SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling
☆49Apr 17, 2026Updated 3 months ago