Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"
β295Jan 29, 2026Updated 2 months ago
Alternatives and similar repositories for MMaDA-Parallel
Users that are interested in MMaDA-Parallel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Paper List for In-context Learning π·β19Jan 3, 2023Updated 3 years ago
- β29Oct 26, 2025Updated 5 months ago
- [CVPR 2026] Official Implementation of Dynamic erf (Derf).β134Mar 22, 2026Updated 3 weeks ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".β64Mar 5, 2026Updated last month
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenesβ27Mar 12, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Model souping for LLMsβ73Nov 18, 2025Updated 4 months ago
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x itsβ¦β402Jan 21, 2026Updated 2 months ago
- β¨β¨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Modelsβ43Apr 10, 2025Updated last year
- [MICCAIβ25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessmentβ20Feb 27, 2026Updated last month
- Memory Efficient Training Framework for Large Video Generation Modelβ25Apr 22, 2024Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressiβ¦β23Oct 1, 2025Updated 6 months ago
- β21Nov 27, 2025Updated 4 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.β31Jul 9, 2025Updated 9 months ago
- Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"β80Oct 29, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MLX Implementation of Recursive Reasoning with Tiny Networksβ78Oct 11, 2025Updated 6 months ago
- Official code for DeepSound-V1β13May 14, 2025Updated 11 months ago
- [CVPR 2026] SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Imagesβ48Jan 24, 2026Updated 2 months ago
- Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)β20Updated this week
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".β94Mar 31, 2026Updated 2 weeks ago
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocitiesβ75Dec 21, 2025Updated 3 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learningβ23Jan 11, 2026Updated 3 months ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdfβ11Jul 25, 2023Updated 2 years ago
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.β15Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"β33Jul 25, 2025Updated 8 months ago
- Rethinking the Trust Region in LLM Reinforcement Learningβ51Mar 2, 2026Updated last month
- πThe official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"β22Dec 2, 2025Updated 4 months ago
- β18May 10, 2023Updated 2 years ago
- MSP project: Latent Space Factorisation and Manipulation via Matrix Subspace Projection (ICML2020)β14Dec 4, 2021Updated 4 years ago
- β33Nov 18, 2025Updated 4 months ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoningβ¦β29Sep 7, 2025Updated 7 months ago
- [JAG 2022] Multitask consistency network with single temporal supervision for semi-supervised building change detectionβ20Aug 25, 2024Updated last year
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neurβ¦β15Jul 7, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A variational auto-encoder (VAE) framework with a new type of prior "Variational Mixture of Posteriors" prior, or VampPrior for short.β10Apr 7, 2021Updated 5 years ago
- β10Oct 7, 2023Updated 2 years ago
- β13May 17, 2025Updated 10 months ago
- [CVPR 2026 Main] MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generationβ21Mar 26, 2026Updated 2 weeks ago
- β13Jan 14, 2026Updated 3 months ago
- π¦ A tool for dump Tauri assetsβ22Jan 6, 2026Updated 3 months ago
- β34Aug 26, 2025Updated 7 months ago