seal-rg / recurrent-pretrainingLinks
Pretraining code for a large-scale depth-recurrent language model
☆783Updated 2 weeks ago
Alternatives and similar repositories for recurrent-pretraining
Users that are interested in recurrent-pretraining are comparing it to the libraries listed below
Sorting:
- Training Large Language Model to Reason in a Continuous Latent Space☆1,162Updated 5 months ago
- Dream 7B, a large diffusion language model☆774Updated last week
- procedural reasoning datasets☆872Updated this week
- Recipes to scale inference-time compute of open models☆1,097Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆991Updated last month
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆383Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆339Updated 6 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆315Updated 7 months ago
- Muon is Scalable for LLM Training☆1,081Updated 2 months ago
- LIMO: Less is More for Reasoning☆963Updated 2 months ago
- Code for BLT research paper☆1,686Updated last month
- ☆782Updated last month
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆703Updated 2 months ago
- Build your own visual reasoning model☆385Updated last week
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆476Updated last month
- Tina: Tiny Reasoning Models via LoRA☆260Updated 3 weeks ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,106Updated 4 months ago
- ☆1,025Updated 6 months ago
- ☆939Updated 5 months ago
- ☆570Updated 2 months ago
- A bibliography and survey of the papers surrounding o1☆1,201Updated 7 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆788Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆1,328Updated this week
- TTRL: Test-Time Reinforcement Learning☆650Updated 3 weeks ago
- Code for the paper: "Learning to Reason without External Rewards"☆306Updated last week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆551Updated 3 months ago
- Scalable RL solution for advanced reasoning of language models☆1,622Updated 3 months ago
- A project to improve skills of large language models☆429Updated this week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆489Updated last month
- Muon: An optimizer for hidden layers in neural networks☆897Updated 2 weeks ago