Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆215Dec 17, 2025Updated 5 months ago
Alternatives and similar repositories for LaViDa
Users that are interested in LaViDa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 10 months ago
- ☆341Mar 23, 2026Updated 2 months ago
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)☆1,642Feb 14, 2026Updated 3 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 6 months ago
- ☆31Aug 18, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Implementation of wd1☆30Sep 25, 2025Updated 8 months ago
- Dream 7B, a large diffusion language model☆1,240Nov 21, 2025Updated 6 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,807Nov 12, 2025Updated 6 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆1,002Updated this week
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆167Feb 16, 2026Updated 3 months ago
- [ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models☆144Dec 25, 2025Updated 5 months ago
- Easy and Efficient dLLM Fine-Tuning☆256Mar 2, 2026Updated 2 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆114Apr 13, 2026Updated last month
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆379Apr 4, 2026Updated last month
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆1,004Jul 10, 2025Updated 10 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆47Jul 17, 2025Updated 10 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 9 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,933Jan 8, 2026Updated 4 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆506Jan 28, 2026Updated 4 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 5 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆131May 22, 2025Updated last year
- A Collection of Papers on Diffusion Language Models☆171Sep 15, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆61Apr 7, 2026Updated last month
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 6 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆524Nov 14, 2025Updated 6 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- ☆29Nov 9, 2025Updated 6 months ago
- Official implementation of BLIP3o-Series☆1,655Nov 29, 2025Updated 6 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 7 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 6 months ago
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆205Jun 18, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆201Sep 18, 2025Updated 8 months ago
- Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'☆45May 10, 2026Updated 3 weeks ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆167Jan 19, 2026Updated 4 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- ☆53Aug 22, 2025Updated 9 months ago