Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"
☆294Jan 29, 2026Updated last month
Alternatives and similar repositories for MMaDA-Parallel
Users that are interested in MMaDA-Parallel are comparing it to the libraries listed below
Sorting:
- Official Implementation of Dynamic erf (Derf).☆133Dec 12, 2025Updated 3 months ago
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes☆27Mar 12, 2026Updated last week
- ☆11Dec 15, 2025Updated 3 months ago
- 第九届中国软件杯视频全量分析“一等奖”&第十届中国软件杯A2百度paddlepaddle跟踪赛道“二等奖”☆10Jul 10, 2023Updated 2 years ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆63Mar 5, 2026Updated 2 weeks ago
- Model souping for LLMs☆72Nov 18, 2025Updated 4 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 11 months ago
- EraseAnything, ICML 2025☆39Sep 28, 2025Updated 5 months ago
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆31Jul 9, 2025Updated 8 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…☆246Oct 12, 2025Updated 5 months ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆47Jul 1, 2025Updated 8 months ago
- [MM '24] EvilEdit: Backdooring Text-to-Image Diffusion Models in One Second☆28Nov 19, 2024Updated last year
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆45Updated this week
- SR-DiT Speedrunning ImageNet Diffusion☆130Dec 31, 2025Updated 2 months ago
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆32Mar 10, 2026Updated last week
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities☆72Dec 21, 2025Updated 3 months ago
- Extended depth of field methods using CNN's☆15Apr 28, 2023Updated 2 years ago
- ☆23Jan 9, 2026Updated 2 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.☆15Mar 12, 2024Updated 2 years ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated last month
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆45Mar 2, 2026Updated 2 weeks ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆106Feb 3, 2026Updated last month
- MSP project: Latent Space Factorisation and Manipulation via Matrix Subspace Projection (ICML2020)☆14Dec 4, 2021Updated 4 years ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆29Sep 7, 2025Updated 6 months ago
- [JAG 2022] Multitask consistency network with single temporal supervision for semi-supervised building change detection☆20Aug 25, 2024Updated last year
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neur…☆15Jul 7, 2025Updated 8 months ago
- ☆13Jan 14, 2026Updated 2 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- ☆13May 17, 2025Updated 10 months ago
- 简单的pagerank基础上加上稀疏化矩阵化并行化等处理☆12Oct 8, 2019Updated 6 years ago
- ☆34Aug 26, 2025Updated 6 months ago
- Official implementation of Categorical Flow Maps on text.☆47Feb 16, 2026Updated last month
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 8 months ago
- [NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark☆292Nov 5, 2025Updated 4 months ago