M-E-AGI-Lab/Muddit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/M-E-AGI-Lab/Muddit)

M-E-AGI-Lab / Muddit

[ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.

☆119

Alternatives and similar repositories for Muddit

Users that are interested in Muddit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alexanderswerdlow / unidisc
View on GitHub
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆142Apr 2, 2025Updated last year
viiika / HumanEdit
View on GitHub
[CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…
☆36May 8, 2025Updated last year
fudoki-hku / FUDOKI
View on GitHub
[NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
☆77Dec 21, 2025Updated 7 months ago
Shi-qingyu / RecTok
View on GitHub
[CVPR 26] Official PyTorch Implementation of RecTok
☆23Feb 24, 2026Updated 5 months ago
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Ephemeral182 / Empirical-Study-of-GPT-4o-Image-Gen
View on GitHub
An Empirical Study of GPT-4o Image Generation Capabilities
☆29Apr 16, 2025Updated last year
Jiawei-Yang / DeTok
View on GitHub
Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"
☆195Feb 24, 2026Updated 5 months ago
Shi-qingyu / DreamRelation
View on GitHub
[CVPR 2025] DreamRelation: Bridging Customization and Relation Generation
☆19Dec 17, 2025Updated 7 months ago
furiosa-ai / uncage
View on GitHub
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
☆17Aug 12, 2025Updated 11 months ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆22Mar 4, 2026Updated 4 months ago
ByteVisionLab / TokenFlow
View on GitHub
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
☆464Aug 8, 2025Updated 11 months ago
collovlabs / ViewControl
View on GitHub
[IJCAI 2024] Official implementation of the paper "Integrating View Conditions for Image Synthesis"
☆25Aug 27, 2024Updated last year
M-E-AGI-Lab / Awesome-World-Models
View on GitHub
Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.
☆96Oct 26, 2025Updated 8 months ago
mercurystraw / Kris_Bench
View on GitHub
[NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"
☆46Oct 19, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
viiika / Meissonic
View on GitHub
[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…
☆345Jul 15, 2026Updated last week
nnnth / UniLIP
View on GitHub
[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"
☆151Jan 26, 2026Updated 5 months ago
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
showlab / FQGAN
View on GitHub
FQGAN: Factorized Visual Tokenization and Generation
☆59Mar 29, 2025Updated last year
wusize / Harmon
View on GitHub
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
☆191May 21, 2025Updated last year
causalfusion / causalfusion
View on GitHub
☆196Dec 17, 2024Updated last year
djghosh13 / geneval
View on GitHub
GenEval: An object-focused framework for evaluating text-to-image alignment
☆472Mar 3, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
viiika / Diffusion-Conductor
View on GitHub
[AAAI 2023 Summer Symposium, Best Paper Award] Taming Diffusion Models for Music-driven Conducting Motion Generation
☆26May 9, 2024Updated 2 years ago
turingmotors / One-D-Piece
View on GitHub
[ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
☆82Jul 30, 2025Updated 11 months ago
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆453Jan 26, 2026Updated 5 months ago
hithqd / ReasonBrain
View on GitHub
【ICML2026】Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
☆27May 18, 2026Updated 2 months ago
DAMO-NLP-SG / DiGIT
View on GitHub
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆78Oct 31, 2024Updated last year
zijieli-Jlee / Dual-Diffusion
View on GitHub
Code for D-DiT
☆69Apr 1, 2025Updated last year
ATH-MaaS / Awesome-Unified-Multimodal-Models
View on GitHub
Awesome Unified Multimodal Models
☆1,305Mar 24, 2026Updated 4 months ago
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
chuwd19 / Split-Gibbs-Discrete-Diffusion-Posterior-Sampling
View on GitHub
☆15Aug 16, 2025Updated 11 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated last year
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,664Nov 29, 2025Updated 7 months ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆431Jun 20, 2025Updated last year
Correr-Zhou / MagicTailor
View on GitHub
[IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion …
☆98Jan 18, 2026Updated 6 months ago
PKU-YuanGroup / ImgEdit
View on GitHub
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
☆328Nov 5, 2025Updated 8 months ago
Mikivishy / FullFront
View on GitHub
The official code repository for the FullFront benchmark
☆27May 16, 2025Updated last year
xushilin1 / dst-det
View on GitHub
[TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det
☆35Jun 3, 2025Updated last year