ZHZisZZ / dllmLinks
dLLM: Simple Diffusion Language Modeling
☆1,566Updated last week
Alternatives and similar repositories for dllm
Users that are interested in dllm are comparing it to the libraries listed below
Sorting:
- Dream 7B, a large diffusion language model☆1,139Updated last month
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆530Updated 3 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆550Updated last week
- Official implementation of "Continuous Autoregressive Language Models"☆686Updated last month
- [ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆940Updated 6 months ago
- ☆1,268Updated 2 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆859Updated 2 weeks ago
- H-Net: Hierarchical Network with Dynamic Chunking☆801Updated last month
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆576Updated 3 months ago
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆547Updated last week
- Open-source release accompanying Gao et al. 2025☆490Updated last month
- Tina: Tiny Reasoning Models via LoRA☆314Updated 3 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,449Updated 5 months ago
- ☆371Updated 2 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆370Updated last year
- PyTorch building blocks for the OLMo ecosystem☆681Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆357Updated 6 months ago
- ☆949Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆936Updated 7 months ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆573Updated 3 weeks ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆725Updated 3 weeks ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆312Updated 2 months ago
- ☆465Updated 4 months ago
- An interface library for RL post training with environments.☆1,004Updated this week
- ☆204Updated last year
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆356Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆225Updated 10 months ago
- A project to improve skills of large language models☆756Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆614Updated last week
- Scalable toolkit for efficient model reinforcement☆1,227Updated this week