HKUNLP / Dream
Dream 7B, a large diffusion language model
☆622Updated last week
Alternatives and similar repositories for Dream:
Users that are interested in Dream are comparing it to the libraries listed below
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆569Updated 3 weeks ago
- Pretraining code for a large-scale depth-recurrent language model☆755Updated 3 weeks ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆1,556Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆908Updated 3 weeks ago
- Muon is Scalable for LLM Training☆1,039Updated last month
- ☆739Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆322Updated 4 months ago
- ☆268Updated this week
- TTRL: Test-Time Reinforcement Learning☆407Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,094Updated 3 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆436Updated last month
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆165Updated 4 months ago
- ☆524Updated 3 weeks ago
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆611Updated last month
- Large Reasoning Models☆804Updated 5 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆583Updated last month
- Recipes to scale inference-time compute of open models☆1,066Updated 2 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆338Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,198Updated last month
- An Open Large Reasoning Model for Real-World Solutions☆1,488Updated 2 months ago
- Scalable RL solution for advanced reasoning of language models☆1,529Updated last month
- ☆671Updated last week
- OLMoE: Open Mixture-of-Experts Language Models☆739Updated last month
- Official Repo for Open-Reasoner-Zero☆1,904Updated last month
- LIMO: Less is More for Reasoning☆927Updated last month
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆127Updated this week
- Build your own visual reasoning model☆357Updated this week
- Atom of Thoughts for Markov LLM Test-Time Scaling☆560Updated last week
- Tina: Tiny Reasoning Models via LoRA☆164Updated 2 weeks ago
- Code for BLT research paper☆1,558Updated last week