fudoki-hku/FUDOKI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fudoki-hku/FUDOKI)

fudoki-hku / FUDOKI

[NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

☆77

Alternatives and similar repositories for FUDOKI

Users that are interested in FUDOKI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

M-E-AGI-Lab / Muddit
View on GitHub
[ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…
☆119Apr 13, 2026Updated 3 months ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆22Mar 4, 2026Updated 4 months ago
OpenGVLab / SDLM
View on GitHub
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…
☆98Dec 27, 2025Updated 7 months ago
Labman42 / JetEngine
View on GitHub
A lightweight Inference Engine built for block diffusion models
☆47Apr 12, 2026Updated 3 months ago
kuleshov-group / e2d2
View on GitHub
[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
☆47Oct 29, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago
baaivision / URSA
View on GitHub
[ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation
☆123May 20, 2026Updated 2 months ago
inclusionAI / Ming-UniVision
View on GitHub
Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer
☆143Oct 14, 2025Updated 9 months ago
YuqingWang1029 / TokenBridge
View on GitHub
[ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…
☆158Jul 24, 2025Updated last year
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
ML-GSAI / LLaDA-V
View on GitHub
☆348Mar 23, 2026Updated 4 months ago
YuqingWang1029 / CubiD
View on GitHub
[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…
☆63Apr 10, 2026Updated 3 months ago
dongzhuoyao / flowseq
View on GitHub
An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"
☆34Jul 17, 2025Updated last year
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,064May 30, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
OliverRensu / xAR
View on GitHub
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generat…
☆251Oct 12, 2025Updated 9 months ago
LiQiiiii / DLLM-Survey
View on GitHub
[Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey
☆387Apr 4, 2026Updated 3 months ago
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆364Jun 2, 2026Updated last month
alexanderswerdlow / unidisc
View on GitHub
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆142Apr 2, 2025Updated last year
nnnth / UniLIP
View on GitHub
[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"
☆151Jan 26, 2026Updated 6 months ago
OpenGVLab / MMIU
View on GitHub
[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
☆98Sep 14, 2024Updated last year
CompVis / maskflow
View on GitHub
MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation
☆28Mar 4, 2025Updated last year
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
MCG-NJU / PixNerd
View on GitHub
[ICLR 2026] PixNerd: Pixel Neural Field Diffusion
☆185Dec 10, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hithqd / ReasonBrain
View on GitHub
【ICML2026】Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
☆27May 18, 2026Updated 2 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,960Aug 15, 2024Updated last year
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆692Feb 27, 2026Updated 5 months ago
ali-vilab / iv-vae
View on GitHub
☆34Mar 4, 2025Updated last year
phymhan / S2D2
View on GitHub
☆16Jun 17, 2026Updated last month
ByteVisionLab / TokenFlow
View on GitHub
[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
☆464Aug 8, 2025Updated 11 months ago
ali-vilab / alitok
View on GitHub
[ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model
☆56Oct 12, 2025Updated 9 months ago
turingmotors / One-D-Piece
View on GitHub
[ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression
☆84Jul 30, 2025Updated 11 months ago
ali-vilab / ViewPoint
View on GitHub
[NeurIPS 2025] ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
☆34Jul 1, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 8 months ago
Gen-Verse / dLLM-RL
View on GitHub
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
☆511Jan 28, 2026Updated 6 months ago
kuleshov-group / d2
View on GitHub
d2: Improved Techinques for Training Reasonoing Diffusion Language Models
☆16Mar 25, 2026Updated 4 months ago
google-deepmind / md4
View on GitHub
Official Jax Implementation of MD4 Masked Diffusion Models
☆162Feb 27, 2025Updated last year
csuhan / Tar
View on GitHub
[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆202Sep 18, 2025Updated 10 months ago
Yovecent / UDM-GRPO
View on GitHub
[ICML 2026 Spotlight] UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
☆27May 1, 2026Updated 2 months ago
Ego4DSounds / Ego4DSounds
View on GitHub
Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence
☆21Jun 14, 2024Updated 2 years ago