Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"
β298Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for MMaDA-Parallel
Users that are interested in MMaDA-Parallel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Paper List for In-context Learning π·β19Jan 3, 2023Updated 3 years ago
- [CVPR 2026] Official Implementation of Dynamic erf (Derf).β146Mar 22, 2026Updated 2 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".β64Mar 5, 2026Updated 2 months ago
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenesβ29Mar 12, 2026Updated 2 months ago
- Model souping for LLMsβ73Nov 18, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x itsβ¦β407Jan 21, 2026Updated 4 months ago
- [MICCAIβ25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessmentβ22Feb 27, 2026Updated 3 months ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Visionβ227Apr 14, 2026Updated last month
- Memory Efficient Training Framework for Large Video Generation Modelβ25Apr 22, 2024Updated 2 years ago
- EraseAnything, ICML 2025β41Sep 28, 2025Updated 7 months ago
- β21Nov 27, 2025Updated 5 months ago
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generatβ¦β249Oct 12, 2025Updated 7 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.β33Apr 18, 2026Updated last month
- MLX Implementation of Recursive Reasoning with Tiny Networksβ78Oct 11, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code for DeepSound-V1β12May 14, 2025Updated last year
- The code repository of UniRLβ52May 30, 2025Updated 11 months ago
- Simple and Ideal Circuit Simulationβ13Dec 4, 2017Updated 8 years ago
- Variational Autoencoder with non-euclidean (hyperbolic) latent spaceβ13Nov 25, 2022Updated 3 years ago
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocitiesβ76Dec 21, 2025Updated 5 months ago
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".β97Mar 31, 2026Updated last month
- Extended depth of field methods using CNN'sβ16Apr 28, 2023Updated 3 years ago
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasksβ37Apr 9, 2026Updated last month
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.β15Mar 12, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdfβ11Jul 25, 2023Updated 2 years ago
- [CVPR 2026 Highlight] SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Imagesβ55Updated this week
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"β33Jul 25, 2025Updated 10 months ago
- Rethinking the Trust Region in LLM Reinforcement Learningβ54Mar 2, 2026Updated 2 months ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysisβ28Feb 14, 2026Updated 3 months ago
- β18May 10, 2023Updated 3 years ago
- Code for our paper "Learning to Generate Unit Tests for Automated Debugging"β18Mar 7, 2025Updated last year
- MSP project: Latent Space Factorisation and Manipulation via Matrix Subspace Projection (ICML2020)β14Dec 4, 2021Updated 4 years ago
- β34Nov 18, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoningβ¦β30Sep 7, 2025Updated 8 months ago
- Official PyTorch implementation for Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability [Neurβ¦β17Jul 7, 2025Updated 10 months ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This projβ¦β39Feb 20, 2026Updated 3 months ago
- β10Oct 7, 2023Updated 2 years ago
- β13May 17, 2025Updated last year
- [CVPR 2026 Main] MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generationβ24Updated this week
- β13Jan 14, 2026Updated 4 months ago