seal-rg / recurrent-pretraining
Pretraining code for a large-scale depth-recurrent language model
☆745Updated last week
Alternatives and similar repositories for recurrent-pretraining:
Users that are interested in recurrent-pretraining are comparing it to the libraries listed below
- Training Large Language Model to Reason in a Continuous Latent Space☆1,062Updated 3 months ago
- ☆519Updated last week
- Dream 7B, a large diffusion language model☆572Updated 2 weeks ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆882Updated last week
- Recipes to scale inference-time compute of open models☆1,058Updated 2 months ago
- LIMO: Less is More for Reasoning☆920Updated 2 weeks ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆325Updated this week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆317Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆827Updated 3 weeks ago
- Build your own visual reasoning model☆341Updated this week
- Large Reasoning Models☆802Updated 4 months ago
- procedural reasoning datasets☆571Updated this week
- Muon optimizer: +>30% sample efficiency with <3% wallclock overhead☆577Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"☆1,520Updated 2 weeks ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆862Updated 2 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆385Updated 2 weeks ago
- A bibliography and survey of the papers surrounding o1☆1,187Updated 5 months ago
- ☆922Updated 3 months ago
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆551Updated last week
- OLMoE: Open Mixture-of-Experts Language Models☆716Updated last month
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆304Updated 5 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,171Updated 2 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,462Updated this week
- Muon is Scalable for LLM Training☆1,029Updated 3 weeks ago
- ☆647Updated 3 weeks ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆438Updated 3 weeks ago
- Automatic evals for LLMs☆373Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,483Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆509Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆252Updated 2 months ago