umm-emma/emma

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/umm-emma/emma)

umm-emma / emma

Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."

☆62

Alternatives and similar repositories for emma

Users that are interested in emma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JIA-Lab-research / DreamOmni3
View on GitHub
This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''
☆40Dec 30, 2025Updated 6 months ago
wren93 / tuna
View on GitHub
☆94Apr 29, 2026Updated 2 months ago
ByteVisionLab / NextFlow
View on GitHub
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
☆331Jan 9, 2026Updated 6 months ago
SanicsP / ComfyUI-CsvUtils
View on GitHub
A comfyui extension to manage prompts in a simple way using mainly csv files.
☆21Aug 8, 2025Updated 11 months ago
guozinan126 / MUSAR
View on GitHub
☆30May 7, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Hullabalo / ComfyUI-Loop
View on GitHub
Loop your image from output to input in your ComfyUI workflow
☆14Jan 16, 2026Updated 6 months ago
tinnerhrhe / GARDO
View on GitHub
Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"
☆61May 3, 2026Updated 2 months ago
AhBumm / ComfyUI_BillBum_APIset_Nodes
View on GitHub
A comfyui costume node by BillBum for using api gen (VLM LLM T2I API Tools)
☆11May 26, 2026Updated 2 months ago
nv-tlabs / ChronoEdit
View on GitHub
[ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
☆698Nov 20, 2025Updated 8 months ago
zhengdian1 / AIA
View on GitHub
☆45Jan 4, 2026Updated 6 months ago
meituan-longcat / LongCat-Image
View on GitHub
☆710May 9, 2026Updated 2 months ago
wjl0313 / ComfyUI_KimNodes
View on GitHub
☆53Sep 22, 2025Updated 10 months ago
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆67Mar 27, 2026Updated 3 months ago
SkyworkAI / UniPic
View on GitHub
Open-source SOTA multi-image editing model
☆871Jul 13, 2026Updated last week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆57Oct 6, 2025Updated 9 months ago
smthemex / ComfyUI_SHMT
View on GitHub
You can use SHMT method to apply makeup to the characters when use ComfyUI
☆29Jan 9, 2025Updated last year
AconexOfficial / ComfyUI_GOAT_Nodes
View on GitHub
Nodes to level up your workflows performance and streamline specific functions.
☆11Aug 19, 2025Updated 11 months ago
Tr1stesse / DirectEdit
View on GitHub
[ICML 2026] Official implementation for "DirectEdit: Step-Level Accurate Inversion for Flow-Based Image Editing".
☆28May 5, 2026Updated 2 months ago
Hungryyan1 / UniCorn
View on GitHub
☆79Apr 12, 2026Updated 3 months ago
fenfenfenfan / VMix
View on GitHub
Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
☆191Dec 31, 2024Updated last year
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆739Updated this week
showlab / OmniPSD
View on GitHub
Official code implementation of "OmniPSD: Layered PSD Generation with Diffusion Transformer"
☆129May 25, 2026Updated 2 months ago
nudtfuruigang / traffic-light-detection
View on GitHub
☆10Feb 17, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DEVAIEXP / mod-control-tile-upscaler-sdxl
View on GitHub
MoD Control Tile Upscaler for SDXL Pipeline
☆61Mar 8, 2025Updated last year
lian700 / SoliReward
View on GitHub
Official Code for "SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models" [CVPR2…
☆21Jul 13, 2026Updated last week
WainWong / ComfyUI-Loop-image
View on GitHub
A comfyui node that uses the loop function to process images and masks
☆44Mar 28, 2025Updated last year
Bria-AI / Fibo-Edit
View on GitHub
FIBO-Edit brings the power of structured prompt generation to image editing
☆44Jan 29, 2026Updated 5 months ago
knightyxp / VideoCoF
View on GitHub
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
☆204Jun 17, 2026Updated last month
flying-sky999 / OmniV2V
View on GitHub
☆15Jun 2, 2025Updated last year
k1n0F / sageattention3-blackwell-wsl2
View on GitHub
☆15Nov 5, 2025Updated 8 months ago
NOVAglow646 / Monet
View on GitHub
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
☆213Mar 19, 2026Updated 4 months ago
Alpha-VLLM / Lumina-Accessory
View on GitHub
☆119Apr 25, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
matsuolab / multibanana
View on GitHub
[CVPR 2026 Main] MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation
☆29Jul 6, 2026Updated 2 weeks ago
W2GenAI-Lab / UltraFlux
View on GitHub
UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
☆145Apr 9, 2026Updated 3 months ago
franciszzj / Saber
View on GitHub
[CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation
☆76Apr 28, 2026Updated 2 months ago
byliutao / CDM
View on GitHub
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation👏
☆147May 11, 2026Updated 2 months ago
little-misfit / GRAG-Image-Editing
View on GitHub
https://little-misfit.github.io/GRAG-Image-Editing/
☆119Nov 27, 2025Updated 7 months ago
ATH-MaaS / Ovis-Image
View on GitHub
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stri…
☆319May 15, 2026Updated 2 months ago
EnragedAntelope / ComfyUI-Doubutsu-Describer
View on GitHub
[DEPRECATED] Comfy node for using Doubutsu VLM for image descriptions
☆12Nov 9, 2025Updated 8 months ago