hustvl/DiffusionVL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hustvl/DiffusionVL)

hustvl / DiffusionVL

[ECCV 2026] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

☆155

Alternatives and similar repositories for DiffusionVL

Users that are interested in DiffusionVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hustvl / TBCM
View on GitHub
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs
☆21Dec 16, 2025Updated 7 months ago
hustvl / VGT
View on GitHub
Visual Generation Tuning
☆101Apr 16, 2026Updated 3 months ago
hustvl / MobileI2V
View on GitHub
[ArXiv 2025] MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices
☆87May 20, 2026Updated 2 months ago
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆51Mar 25, 2026Updated 3 months ago
hustvl / mmMamba
View on GitHub
The first decoder-only multimodal state space model
☆104May 19, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hustvl / InfiniteVL
View on GitHub
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
☆110Jul 7, 2026Updated 2 weeks ago
hustvl / Snap-Snap
View on GitHub
The repository of "Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds"
☆40Sep 1, 2025Updated 10 months ago
ZrH42 / UniX
View on GitHub
☆31Mar 29, 2026Updated 3 months ago
hustvl / MaTVLM
View on GitHub
☆62May 13, 2025Updated last year
hustvl / MolSight
View on GitHub
[AAAI 2026] MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learni…
☆27Dec 5, 2025Updated 7 months ago
hustvl / Turbo-VAED
View on GitHub
[AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
☆131Jul 10, 2026Updated last week
hustvl / SuperCLIP
View on GitHub
☆140Dec 26, 2025Updated 6 months ago
hustvl / ViTGaze
View on GitHub
Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"
☆62Mar 3, 2025Updated last year
ML-GSAI / LLaDA-V
View on GitHub
☆347Mar 23, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jacklishufan / LaViDa
View on GitHub
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆227Dec 17, 2025Updated 7 months ago
MiniMax-AI / VTP
View on GitHub
[ECCV 2026] Towards Scalable Pre-training of Visual Tokenizers for Generation
☆495Apr 15, 2026Updated 3 months ago
hustvl / MoDA
View on GitHub
An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".
☆274May 6, 2026Updated 2 months ago
Tencent / WeDLM
View on GitHub
WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…
☆646Mar 3, 2026Updated 4 months ago
lgxi24 / AdaBlock-dLLM
View on GitHub
[ICLR 2026] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
☆15Jan 28, 2026Updated 5 months ago
hustvl / LENS
View on GitHub
[AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning
☆136Dec 3, 2025Updated 7 months ago
hustvl / GaussTR
View on GitHub
[CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
☆217Jan 5, 2026Updated 6 months ago
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
JinjieNi / MegaDLMs
View on GitHub
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…
☆343Nov 11, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆561Apr 3, 2026Updated 3 months ago
hustvl / MaskAdapter
View on GitHub
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
☆135Oct 23, 2025Updated 8 months ago
hustvl / OpenInst
View on GitHub
☆17Nov 17, 2023Updated 2 years ago
hustvl / EVA-X
View on GitHub
[Nature Portfolio, npj DigitalMed] EVA-X: A foundation model for general chest X-ray analysis with self-supervised learning
☆100Jun 12, 2026Updated last month
hustvl / ViG
View on GitHub
[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention
☆116Jun 17, 2024Updated 2 years ago
hustvl / TOGS
View on GitHub
[IEEE JBHI] The official code of "TOGS: Gaussian Splatting with Temporal Opacity Offset for Real-Time 4D DSA Rendering"
☆33Sep 10, 2025Updated 10 months ago
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 4 months ago
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,063May 30, 2026Updated last month
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,508Dec 16, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Gen-Verse / dLLM-RL
View on GitHub
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
☆512Jan 28, 2026Updated 5 months ago
Cooperx521 / ScaleCap
View on GitHub
(ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’
☆60Jan 26, 2026Updated 5 months ago
Alpha-VLLM / Lumina-DiMOO
View on GitHub
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
☆1,003May 19, 2026Updated 2 months ago
ML-GSAI / ReFusion
View on GitHub
[ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"
☆63Dec 26, 2025Updated 6 months ago
gjhhust / YOLOFT
View on GitHub
A code base for the official XS-VID dataset baseline method YOLOFT
☆22Dec 24, 2024Updated last year
hustvl / DiG
View on GitHub
[CVPR 2025] DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
☆184Mar 1, 2025Updated last year
hustvl / OmniMamba
View on GitHub
[ECCV 2026] OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models
☆126Apr 25, 2025Updated last year