hutaiHang/ToMe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hutaiHang/ToMe)

hutaiHang / ToMe

[NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis

☆86

Alternatives and similar repositories for ToMe

Users that are interested in ToMe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

clf28 / Detail-plus-plus
View on GitHub
[IEEE TIP] Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation
☆33Aug 3, 2025Updated 11 months ago
LeapLabTHU / ENAT
View on GitHub
[NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
☆25Nov 28, 2024Updated last year
I2-Multimedia-Lab / Magnet
View on GitHub
Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…
☆31Dec 2, 2024Updated last year
agwmon / MuDI
View on GitHub
[NeurIPS 2024] MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
☆96Jan 17, 2025Updated last year
RoyiRa / Linguistic-Binding-in-Diffusion-Models
View on GitHub
☆82Nov 25, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
aim-uofa / FreeCustom
View on GitHub
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
☆177Sep 1, 2025Updated 10 months ago
ip-composer / IP-Composer
View on GitHub
☆20Apr 15, 2025Updated last year
sen-mao / Loopfree
View on GitHub
[CVPR2025] Official Implementations "One-Way Ticket : Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models"
☆29Mar 16, 2026Updated 4 months ago
sen-mao / SuppressEOT
View on GitHub
Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)
☆60Dec 3, 2024Updated last year
ylingfeng / Add-SD
View on GitHub
Official implementation of Add-SD: Rational Generation without Manual Reference.
☆28Aug 19, 2024Updated last year
Litalby1 / make-it-count
View on GitHub
Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)
☆96Mar 12, 2025Updated last year
DCDmllm / AnyEdit
View on GitHub
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
☆226Apr 5, 2025Updated last year
luping-liu / Detector-Guidance
View on GitHub
The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)
☆20Feb 7, 2024Updated 2 years ago
hutaiHang / Faster-Diffusion
View on GitHub
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
☆352Mar 16, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆345May 7, 2026Updated 2 months ago
hzphzp / WeGen
View on GitHub
☆27Apr 25, 2025Updated last year
byliutao / 1Prompt1Story
View on GitHub
🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
☆318Oct 20, 2025Updated 9 months ago
pOpsPaper / pOps
View on GitHub
Official implementation for "pOps: Photo-Inspired Diffusion Operators"
☆86Jul 23, 2024Updated 2 years ago
fallenshock / FlowEdit
View on GitHub
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"
☆1,010May 27, 2026Updated last month
yuval-alaluf / Attend-and-Excite
View on GitHub
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
☆771Jan 26, 2024Updated 2 years ago
Nihukat / Concept-Conductor
View on GitHub
☆17Feb 21, 2025Updated last year
pratheba / FORA
View on GitHub
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
☆56Jul 8, 2024Updated 2 years ago
JIA-Lab-research / MagicMirror
View on GitHub
[ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers
☆130Jun 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Xilluill / KV-Edit
View on GitHub
[ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation
☆387May 21, 2025Updated last year
fudan-zvg / FreqPrior
View on GitHub
[ICLR 2025] FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise
☆14Mar 5, 2025Updated last year
liujianzhi / EchoReel
View on GitHub
An innovative method designed to augment the capabilities of existing video diffusion models
☆22May 10, 2024Updated 2 years ago
kodenii / ORES
View on GitHub
ORES: Open-vocabulary Responsible Visual Synthesis
☆14Dec 12, 2023Updated 2 years ago
omer11a / bounded-attention
View on GitHub
☆96Sep 22, 2024Updated last year
WUyinwei-hah / RRNet
View on GitHub
[CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model
☆48Sep 13, 2024Updated last year
hqhQAQ / MIP-Adapter
View on GitHub
[AAAI 2025] Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation
☆174Jul 1, 2025Updated last year
aniki-ly / FreeLong
View on GitHub
[NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…
☆67Jul 2, 2025Updated last year
EnergyAttention / Energy-Based-CrossAttention
View on GitHub
The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".
☆51Apr 1, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NVlabs / QLIP
View on GitHub
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
☆97Mar 1, 2025Updated last year
garibida / cross-image-attention
View on GitHub
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
☆404May 5, 2024Updated 2 years ago
HaozheLiu-ST / T-GATE
View on GitHub
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
☆418Feb 26, 2025Updated last year
Nihukat / FreeGraftor
View on GitHub
☆22Jan 19, 2026Updated 6 months ago
ali-vilab / ChatDiT
View on GitHub
☆53Dec 20, 2024Updated last year
mit-han-lab / fastcomposer
View on GitHub
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
☆715Jan 10, 2025Updated last year
luping-liu / LongAlign
View on GitHub
The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)
☆83Apr 23, 2025Updated last year