Auraithm/LLADA_pretraining

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Auraithm/LLADA_pretraining)

Auraithm / LLADA_pretraining

☆31

Alternatives and similar repositories for LLADA_pretraining

Users that are interested in LLADA_pretraining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JinjieNi / Quokka
View on GitHub
The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…
☆46Nov 6, 2025Updated 8 months ago
JetAstra / SDAR
View on GitHub
SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model（1.7B, 4B, 8B, 30B）
☆362Jun 2, 2026Updated last month
ML-GSAI / Diffusion-LLM-Papers
View on GitHub
A Collection of Papers on Diffusion Language Models
☆180Sep 15, 2025Updated 10 months ago
NEUIR / Uncode
View on GitHub
[ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"
☆44Jun 26, 2026Updated 3 weeks ago
OpenMOSS / DiRL
View on GitHub
☆165Mar 30, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xiaohangt / wd1
View on GitHub
Official Implementation of wd1
☆32Sep 25, 2025Updated 10 months ago
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆645Updated this week
OpenMOSS / Sparse-dLLM
View on GitHub
☆29Oct 16, 2025Updated 9 months ago
zhangyitonggg / dllm4code
View on GitHub
Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".
☆23Oct 29, 2025Updated 8 months ago
JinjieNi / MegaDLMs
View on GitHub
GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…
☆343Nov 11, 2025Updated 8 months ago
OpenMOSS / LongLLaDA
View on GitHub
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
☆55Dec 7, 2025Updated 7 months ago
chinsengi / dUltra-os
View on GitHub
dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning
☆16Jul 11, 2026Updated 2 weeks ago
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
PKU-ML / PAT
View on GitHub
Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"
☆22May 6, 2025Updated last year
OpenMOSS / rope_pp
View on GitHub
[ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
☆33Dec 9, 2025Updated 7 months ago
SJTU-DENG-Lab / LightningRL
View on GitHub
LightningRL: Breaking the Accuracy–Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
☆30Apr 25, 2026Updated 3 months ago
yjyddq / EOSER-ASS-RL
View on GitHub
Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…
☆28Mar 9, 2026Updated 4 months ago
tianyilt / TextCenGen_Background_Adapt
View on GitHub
TextCenGen introduces a dynamic adaptation of the blank region for text-friendly image generation, and enhances T2I model outcomes on arb…
☆38Jul 7, 2025Updated last year
jacklishufan / LaViDa
View on GitHub
Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding
☆227Dec 17, 2025Updated 7 months ago
Linxi000 / MEDS
View on GitHub
☆142Jun 24, 2026Updated last month
Gen-Verse / MMaDA
View on GitHub
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)
☆1,660Feb 14, 2026Updated 5 months ago
autonomousvision / mdpo
View on GitHub
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
☆45Jan 28, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChenyuWang-Monica / REED
View on GitHub
Code for paper: "Learning Diffusion Models with Flexible Representation Guidance"
☆16Mar 18, 2026Updated 4 months ago
cnlinxi / LLM-paper-daily
View on GitHub
Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily
☆10Updated this week
DreamLM / Dream-Coder
View on GitHub
☆106Nov 17, 2025Updated 8 months ago
ML-GSAI / SMDM
View on GitHub
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆384Dec 22, 2024Updated last year
kuleshov-group / remdm
View on GitHub
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆77Feb 7, 2026Updated 5 months ago
VMnK-Run / MARVEL
View on GitHub
[ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training
☆11Sep 13, 2024Updated last year
PKU-Alignment / llms-resist-alignment
View on GitHub
[ACL2025 Best Paper] Language Models Resist Alignment
☆51Jun 11, 2025Updated last year
ZichenWen1 / DIJA
View on GitHub
(ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
☆79Feb 9, 2026Updated 5 months ago
DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,254Nov 21, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Bai-YT / AdaptiveSmoothing
View on GitHub
Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".
☆10Feb 6, 2024Updated 2 years ago
microsoft / visualization-of-thought
View on GitHub
[NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.
☆37Oct 23, 2024Updated last year
sigsep / sigsep-mus-io
View on GitHub
Tools to convert sigsep mus dataset from STEMS <-> WAV
☆12Jul 15, 2020Updated 6 years ago
Zyriix / D2O
View on GitHub
Official implemention for Diffusion Models Are Innate One-Step Generators
☆27Jun 25, 2025Updated last year
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
chr26195 / AP-MDM
View on GitHub
This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".
☆23Nov 17, 2025Updated 8 months ago
blairstar / NaturalDiffusion
View on GitHub
Official Code for "Rethinking Diffusion Model in High Dimension"
☆26May 20, 2025Updated last year