s-sahoo/Eso-LMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/s-sahoo/Eso-LMs)

s-sahoo / Eso-LMs

[ICML 2026] Esoteric Language Models

☆122

Alternatives and similar repositories for Eso-LMs

Users that are interested in Eso-LMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kuleshov-group / remdm
View on GitHub
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆77Feb 7, 2026Updated 5 months ago
gabeguo / any-order-speculative-decoding
View on GitHub
Reviving Any-Order Autoregressive Models via Principled Parallel Sampling and Speculative Decoding
☆16Nov 16, 2025Updated 8 months ago
s-sahoo / scaling-dllms
View on GitHub
[ICML 2026] Scaling Beyond Masked Diffusion Language Models
☆31Jul 3, 2026Updated 3 weeks ago
kuleshov-group / mdlm
View on GitHub
[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model
☆703Sep 29, 2025Updated 9 months ago
SeunggeunKimkr / PRISM
View on GitHub
[ICML 2026] Public repository for fine-tuning Masked Diffusion Models toward provable self-correction.
☆26Jul 5, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
diffusion-llms / awesome-discrete-diffusion-models
View on GitHub
☆17Oct 27, 2025Updated 9 months ago
s-sahoo / duo
View on GitHub
[ICML 2025] The Diffusion Duality
☆236Jun 6, 2026Updated last month
chen-hao-chao / mdm-prime
View on GitHub
[NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking
☆32Jun 15, 2026Updated last month
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆1,025Jul 10, 2025Updated last year
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆453Jan 26, 2026Updated 6 months ago
zhouc20 / HDLM
View on GitHub
Official Repository for NeurIPS 2025 Paper: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models
☆35Oct 13, 2025Updated 9 months ago
kuleshov-group / setdlms
View on GitHub
[ICML 2026] Set Diffusion: Interpolating Token Orderings between Autoregression and Diffusion for Fast and Flexible Decoding
☆22Jul 20, 2026Updated last week
ML-GSAI / SMDM
View on GitHub
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆385Dec 22, 2024Updated last year
chinsengi / dUltra-os
View on GitHub
dUltra: Ultra-Fast Diffusion Large Language Models via Reinforcement Learning
☆16Jul 11, 2026Updated 2 weeks ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hamishivi / tess-2
View on GitHub
Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"
☆58Feb 20, 2025Updated last year
chen-hao-chao / mdm-prime-v2
View on GitHub
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Scaling of Diffusion Language Models
☆27May 23, 2026Updated 2 months ago
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
kuleshov-group / awesome-discrete-diffusion-models
View on GitHub
A curated list for awesome discrete diffusion models resources.
☆571Sep 9, 2025Updated 10 months ago
HKUNLP / DiffuLLaMA
View on GitHub
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆401May 31, 2025Updated last year
maple-research-lab / LLaDOU
View on GitHub
Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]
☆82Dec 17, 2025Updated 7 months ago
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
ML-GSAI / RADD
View on GitHub
Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…
☆84May 30, 2025Updated last year
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,064May 30, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GBATZOLIS / BitstreamDiffusion
View on GitHub
☆15Jul 22, 2026Updated last week
zaydzuhri / token-order-prediction
View on GitHub
Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
☆48May 13, 2026Updated 2 months ago
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 10 months ago
JinjieNi / Quokka
View on GitHub
The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…
☆46Nov 6, 2025Updated 8 months ago
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆643Jul 20, 2026Updated last week
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆212May 1, 2026Updated 2 months ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
TIGER-AI-Lab / One-Shot-CFT
View on GitHub
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]
☆33Sep 1, 2025Updated 10 months ago
dvruette / gidd-easydel
View on GitHub
☆25Dec 16, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kuleshov-group / e2d2
View on GitHub
[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
☆47Oct 29, 2025Updated 8 months ago
locuslab / EqR
View on GitHub
[ICML 2026] Code for Equilibrium Reasoners: learning attractor dynamics for scalable reasoning
☆45Jun 1, 2026Updated last month
inclusionAI / dFactory
View on GitHub
Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 4 months ago
OpenMOSS / LongLLaDA
View on GitHub
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
☆55Dec 7, 2025Updated 7 months ago
OpenGVLab / SDLM
View on GitHub
Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…
☆98Dec 27, 2025Updated 7 months ago
DreamLM / Dream
View on GitHub
Dream 7B, a large diffusion language model
☆1,256Nov 21, 2025Updated 8 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year