M-E-AGI-Lab / MudditLinks
Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.
☆59Updated 3 weeks ago
Alternatives and similar repositories for Muddit
Users that are interested in Muddit are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆109Updated last month
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆107Updated 2 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆56Updated last month
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆63Updated 4 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆98Updated this week
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆62Updated last month
- ☆152Updated last week
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆37Updated 3 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆113Updated 3 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 6 months ago
- ☆37Updated last month
- Code for paper "Principal Components" Enable A New Language of Images☆44Updated 2 weeks ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆30Updated 2 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆164Updated 3 months ago
- Distilling Diversity and Control in Diffusion Models☆41Updated last month
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆51Updated 2 months ago
- ☆66Updated last week
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆39Updated 2 months ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 4 months ago
- ☆50Updated 6 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆67Updated 2 months ago
- Vico: Compositional Video Generation as Flow Equalization☆57Updated 7 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆69Updated 7 months ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆70Updated last week
- The code repository of UniRL☆30Updated 3 weeks ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆96Updated last week
- Quick Long Video Understanding☆55Updated last week
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆23Updated 3 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆45Updated 4 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆49Updated 3 months ago