JinjieNi/OpenMoE2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JinjieNi/OpenMoE2)

JinjieNi / OpenMoE2

The official repo for "OpenMoE 2: Sparse Diffusion Language Models".

☆58

Alternatives and similar repositories for OpenMoE2

Users that are interested in OpenMoE2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JinjieNi / Quokka
View on GitHub
The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…
☆46Nov 6, 2025Updated 8 months ago
JinjieNi / MegaDLMs
View on GitHub
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…
☆343Nov 11, 2025Updated 8 months ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
Labman42 / JetEngine
View on GitHub
A lightweight Inference Engine built for block diffusion models
☆47Apr 12, 2026Updated 3 months ago
OpenGVLab / SDLM
View on GitHub
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…
☆98Dec 27, 2025Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JinjieNi / MixEval-X
View on GitHub
The official github repo for MixEval-X, the first any-to-any, real-world benchmark.
☆17Feb 15, 2025Updated last year
inclusionAI / dInfer
View on GitHub
dInfer: An Efficient Inference Framework for Diffusion Language Models
☆475Feb 11, 2026Updated 5 months ago
zhrli324 / Corba
View on GitHub
☆18May 17, 2025Updated last year
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 4 months ago
Gen-Verse / dLLM-RL
View on GitHub
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
☆511Jan 28, 2026Updated 5 months ago
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,024Jul 10, 2025Updated last year
OpenMOSS / DiRL
View on GitHub
☆165Mar 30, 2026Updated 3 months ago
LuLuLuyi / TDAR
View on GitHub
Advancing Block Diffusion Language Models for Test-Time Scaling
☆16Feb 14, 2026Updated 5 months ago
chen-hao-chao / mdm-prime-v2
View on GitHub
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Scaling of Diffusion Language Models
☆27May 23, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
LLM360 / TxT360
View on GitHub
☆25Dec 18, 2024Updated last year
ali-vilab / matrix
View on GitHub
☆34Apr 8, 2025Updated last year
ByteDance-Seed / Stable-DiffCoder
View on GitHub
Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …
☆83Mar 9, 2026Updated 4 months ago
SqueezeAILab / CDLM
View on GitHub
CDLM: Consistency Diffusion Language Models for Faster Sampling
☆41Nov 25, 2025Updated 8 months ago
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆645Updated this week
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆362Jun 2, 2026Updated last month
s-sahoo / Eso-LMs
View on GitHub
[ICML 2026] Esoteric Language Models
☆121Jul 13, 2026Updated last week
brianlck / FlexMDM
View on GitHub
☆55Sep 10, 2025Updated 10 months ago
DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,255Nov 21, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KlingAIResearch / DiffMoE
View on GitHub
[Arxiv 2025] Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
☆175Oct 21, 2025Updated 9 months ago
xUhEngwAng / pinyin
View on GitHub
这个仓库包含了我在上人工智能课时完成的拼音输入法作业。
☆11Feb 16, 2022Updated 4 years ago
mcleish7 / retrofitting-recurrence
View on GitHub
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
☆68Nov 11, 2025Updated 8 months ago
ModalMinds / gym-v
View on GitHub
A unified framework for vision-language environments with Gymnasium-compatible interface
☆35Mar 17, 2026Updated 4 months ago
HazyResearch / ThunderMittens
View on GitHub
☆19Aug 26, 2025Updated 10 months ago
chili-lab / LT2
View on GitHub
Official Codebase: LT2: Linear-Time Looped Transformers.
☆49May 27, 2026Updated last month
Introspective-Diffusion / I-DLM
View on GitHub
☆151Apr 15, 2026Updated 3 months ago
tilde-research / nsa-release
View on GitHub
An efficient implementation of the NSA (Native Sparse Attention) kernel
☆133Jun 24, 2025Updated last year
cychomatica / FreeDave
View on GitHub
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
☆23May 19, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ML-GSAI / LLaDA
View on GitHub
Official PyTorch implementation for "Large Language Diffusion Models"
☆3,912Jul 15, 2026Updated last week
s-sahoo / scaling-dllms
View on GitHub
[ICML 2026] Scaling Beyond Masked Diffusion Language Models
☆31Jul 3, 2026Updated 3 weeks ago
ML-GSAI / ReFusion
View on GitHub
[ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"
☆63Dec 26, 2025Updated 7 months ago
czg1225 / DMax
View on GitHub
DMax: Aggressive Parallel Decoding for dLLMs
☆127Jul 5, 2026Updated 3 weeks ago
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆453Jan 26, 2026Updated 6 months ago
adobe-research / LaVida-O
View on GitHub
☆23Dec 1, 2025Updated 7 months ago
Dingry / bunny_teleop_server
View on GitHub
☆21Dec 2, 2024Updated last year