HKUNLP / DreamLinks

Dream 7B, a large diffusion language model

☆816

Alternatives and similar repositories for Dream

Users that are interested in Dream are comparing it to the libraries listed below

Sorting:

kuleshov-group / bd3lms
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆726Updated this week
seal-rg / recurrent-pretraining
Pretraining code for a large-scale depth-recurrent language model
☆793Updated last month
Gen-Verse / MMaDA
MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
☆1,189Updated last month
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,093Updated 3 months ago
ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
☆2,530Updated 3 weeks ago
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,023Updated 2 weeks ago
dllm-reasoning / d1
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆235Updated 2 weeks ago
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆412Updated last month
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,185Updated 5 months ago
KellerJordan / Muon
Muon is an optimizer for hidden layers in neural networks
☆1,092Updated this week
PRIME-RL / TTRL
TTRL: Test-Time Reinforcement Learning
☆704Updated 2 weeks ago
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆248Updated 6 months ago
ByteDance-Seed / Seed-Thinking-v1.5
☆800Updated last month
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆497Updated last week
microsoft / rStar
☆585Updated 3 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆341Updated 7 months ago
ChenxinAn-fdu / POLARIS
Scaling RL on advanced reasoning models
☆392Updated this week
HKUNLP / DiffuLLaMA
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
☆234Updated last month
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆809Updated 4 months ago
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆266Updated last month
MoonshotAI / Kimi-VL
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆956Updated 3 weeks ago
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆977Updated last week
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆563Updated 5 months ago
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆805Updated 7 months ago
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,421Updated 2 months ago
GAIR-NLP / anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
☆774Updated last month
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆404Updated this week
Gen-Verse / ReasonFlux
ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling
☆447Updated last week
ypwang61 / One-Shot-RLVR
official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”
☆323Updated this week
turningpoint-ai / VisualThinker-R1-Zero
Explore the Multimodal “Aha Moment” on 2B Model
☆596Updated 3 months ago