ZHZisZZ / dllmLinks
dLLM: Simple Diffusion Language Modeling
☆1,069Updated this week
Alternatives and similar repositories for dllm
Users that are interested in dllm are comparing it to the libraries listed below
Sorting:
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆360Updated 11 months ago
- Dream 7B, a large diffusion language model☆1,094Updated 2 weeks ago
- Tina: Tiny Reasoning Models via LoRA☆309Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆304Updated last month
- Pretraining and inference code for a large-scale depth-recurrent language model☆850Updated last month
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆522Updated 2 months ago
- An extension of the nanoGPT repository for training small MOE models.☆215Updated 8 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆561Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆352Updated 5 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆646Updated last week
- ☆463Updated 3 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,367Updated 3 months ago
- H-Net: Hierarchical Network with Dynamic Chunking☆788Updated 2 weeks ago
- PyTorch building blocks for the OLMo ecosystem☆482Updated this week
- rl from zero pretrain, can it be done? yes.☆281Updated 2 months ago
- ☆1,226Updated 3 weeks ago
- ☆202Updated 11 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆313Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆924Updated 5 months ago
- ☆335Updated last month
- Build your own visual reasoning model