Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"
☆299Jan 29, 2026Updated 4 months ago
Alternatives and similar repositories for MMaDA-Parallel
Users that are interested in MMaDA-Parallel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- [CVPR 2026] Official Implementation of Dynamic erf (Derf).☆148Mar 22, 2026Updated 2 months ago
- 第九届中国软件杯视频全量分析“一等奖”&第十届中国软件杯A2百度paddlepaddle跟踪赛道“二等奖”☆10Jul 10, 2023Updated 2 years ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆65Mar 5, 2026Updated 3 months ago
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…☆407Jan 21, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆42Apr 10, 2025Updated last year
- [MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment☆23Feb 27, 2026Updated 3 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆228May 31, 2026Updated 2 weeks ago
- Memory Efficient Training Framework for Large Video Generation Model☆25Apr 22, 2024Updated 2 years ago
- EraseAnything, ICML 2025☆41Sep 28, 2025Updated 8 months ago
- Official repository Flash Local Linear Attention☆36May 28, 2026Updated 2 weeks ago
- Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"☆83Oct 29, 2025Updated 7 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆33Apr 18, 2026Updated last month
- Official code for DeepSound-V1☆12May 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code repository of UniRL☆52May 30, 2025Updated last year
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆47Jul 1, 2025Updated 11 months ago
- [MM '24] EvilEdit: Backdooring Text-to-Image Diffusion Models in One Second☆28Nov 19, 2024Updated last year
- SR-DiT Speedrunning ImageNet Diffusion☆138Apr 6, 2026Updated 2 months ago
- Local AI filmmaking studio — skills, canvas, timeline — driven from your coding agent.☆292Jun 9, 2026Updated last week
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".☆98Mar 31, 2026Updated 2 months ago
- Extended depth of field methods using CNN's☆16Apr 28, 2023Updated 3 years ago
- ☆22Jan 9, 2026Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆40Apr 9, 2026Updated 2 months ago
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.☆15Mar 12, 2024Updated 2 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- [CVPR 2026 Highlight] SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images☆60May 28, 2026Updated 2 weeks ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 10 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆25Dec 2, 2025Updated 6 months ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 4 months ago
- ☆18May 10, 2023Updated 3 years ago
- ☆51Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- MSP project: Latent Space Factorisation and Manipulation via Matrix Subspace Projection (ICML2020)☆14Dec 4, 2021Updated 4 years ago
- ☆34Nov 18, 2025Updated 6 months ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆30Sep 7, 2025Updated 9 months ago
- [JAG 2022] Multitask consistency network with single temporal supervision for semi-supervised building change detection☆21Aug 25, 2024Updated last year
- A variational auto-encoder (VAE) framework with a new type of prior "Variational Mixture of Posteriors" prior, or VampPrior for short.☆10Apr 7, 2021Updated 5 years ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆39Feb 20, 2026Updated 3 months ago
- ☆13May 17, 2025Updated last year