seal-rg / recurrent-pretrainingLinks

Pretraining and inference code for a large-scale depth-recurrent language model

☆808

Alternatives and similar repositories for recurrent-pretraining

Users that are interested in recurrent-pretraining are comparing it to the libraries listed below

Sorting:

facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,224Updated 6 months ago
DreamLM / Dream
Dream 7B, a large diffusion language model
☆873Updated last month
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,110Updated 2 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆344Updated 7 months ago
open-thought / reasoning-gym
procedural reasoning datasets
☆1,012Updated this week
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆321Updated 8 months ago
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆830Updated 4 months ago
microsoft / rStar
☆608Updated 3 weeks ago
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆993Updated last week
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,055Updated last week
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,132Updated 6 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆425Updated last week
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆901Updated 3 months ago
groundlight / r1_vlm
Build your own visual reasoning model
☆401Updated this week
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆512Updated 3 weeks ago
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,240Updated this week
facebookresearch / MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆538Updated 2 weeks ago
facebookresearch / blt
Code for BLT research paper
☆1,760Updated 2 months ago
KellerJordan / Muon
Muon is an optimizer for hidden layers in neural networks
☆1,390Updated 3 weeks ago
ezelikman / quiet-star
Code for Quiet-STaR
☆737Updated 11 months ago
PrimeIntellect-ai / prime-rl
Decentralized RL Training at Scale
☆400Updated this week
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆516Updated last week
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆679Updated last week
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆274Updated 2 months ago
srush / awesome-o1
A bibliography and survey of the papers surrounding o1
☆1,209Updated 8 months ago
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆804Updated 8 months ago
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆567Updated 5 months ago
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆344Updated 3 weeks ago
kuleshov-group / bd3lms
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆749Updated 3 weeks ago