HKUNLP / Dream
Dream 7B, a large diffusion language model
☆551Updated last week
Alternatives and similar repositories for Dream:
Users that are interested in Dream are comparing it to the libraries listed below
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆536Updated this week
- Official PyTorch implementation for "Large Language Diffusion Models"☆1,492Updated last week
- Pretraining code for a large-scale depth-recurrent language model☆743Updated this week
- Muon is Scalable for LLM Training☆1,022Updated 3 weeks ago
- ☆662Updated this week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆314Updated 4 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,062Updated 2 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆572Updated last month
- ☆518Updated this week
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆248Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆863Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,141Updated last week
- Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"☆154Updated 3 months ago
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆685Updated this week
- Large Reasoning Models☆802Updated 4 months ago
- Build your own visual reasoning model☆338Updated this week
- A fork to add multimodal model training to open-r1☆1,212Updated 2 months ago
- Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training☆263Updated last month
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation☆750Updated 8 months ago
- Scalable RL solution for advanced reasoning of language models☆1,488Updated last month
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆575Updated 3 weeks ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆320Updated this week
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆154Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆503Updated last month
- ☆630Updated 2 weeks ago
- MineWorld: A Real-time interactive world model on Minecraft☆162Updated this week
- ☆542Updated 2 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,382Updated this week
- LIMO: Less is More for Reasoning☆913Updated 2 weeks ago
- Rethinking Step-by-step Visual Reasoning in LLMs☆287Updated 2 months ago