Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆204Dec 17, 2025Updated 4 months ago
Alternatives and similar repositories for LaViDa
Users that are interested in LaViDa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 9 months ago
- ☆336Mar 23, 2026Updated 3 weeks ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)☆1,622Feb 14, 2026Updated 2 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆46Nov 24, 2025Updated 4 months ago
- ☆31Aug 18, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation of wd1☆28Sep 25, 2025Updated 6 months ago
- Dream 7B, a large diffusion language model☆1,219Nov 21, 2025Updated 4 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,728Nov 12, 2025Updated 5 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆930Updated this week
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆165Feb 16, 2026Updated 2 months ago
- [ICLR'26] Official code of paper "d2Cache: Accelerating Diffusion-based LLMs via Dual Adaptive Caching"☆111Mar 11, 2026Updated last month
- [ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models☆136Dec 25, 2025Updated 3 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆112Updated this week
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆995Jul 10, 2025Updated 9 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 7 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆490Jan 28, 2026Updated 2 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,910Jan 8, 2026Updated 3 months ago
- A Collection of Papers on Diffusion Language Models☆167Sep 15, 2025Updated 7 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 10 months ago
- ☆11May 15, 2024Updated last year
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆54Apr 7, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 5 months ago
- Easy and Efficient dLLM Fine-Tuning☆244Mar 2, 2026Updated last month
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆519Nov 14, 2025Updated 5 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆29Nov 9, 2025Updated 5 months ago
- Official implementation of BLIP3o-Series☆1,648Nov 29, 2025Updated 4 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 3 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 5 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆203Jun 18, 2025Updated 10 months ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆202Sep 18, 2025Updated 7 months ago
- Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'☆45Jan 19, 2026Updated 3 months ago
- ☆51Aug 22, 2025Updated 7 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆164Jan 19, 2026Updated 3 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago