SamsungSAILMontreal / TinyRecursiveModelsLinks
☆5,085Updated 3 weeks ago
Alternatives and similar repositories for TinyRecursiveModels
Users that are interested in TinyRecursiveModels are comparing it to the libraries listed below
Sorting:
- Self-Adapting Language Models☆1,400Updated 2 months ago
- AlphaGo Moment for Model Architecture Discovery.☆1,101Updated 2 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,371Updated 2 weeks ago
- ☆695Updated 2 weeks ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆656Updated 3 weeks ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆584Updated 2 weeks ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆900Updated 4 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆733Updated this week
- Frontier Models playing the board game Diplomacy.☆597Updated last month
- Environments for LLM Reinforcement Learning☆3,391Updated this week
- On the Theoretical Limitations of Embedding-Based Retrieval☆584Updated last month
- Post-training with Tinker☆1,096Updated last week
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,705Updated 2 months ago
- Code for BLT research paper☆1,999Updated 5 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆479Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,159Updated 9 months ago
- Async RL Training at Scale☆722Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,202Updated 3 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆325Updated last year
- Pretraining and inference code for a large-scale depth-recurrent language model☆838Updated last week
- An interface library for RL post training with environments.☆66Updated last week
- Textbook on reinforcement learning from human feedback☆1,279Updated this week
- Tool for generating high quality Synthetic datasets☆1,346Updated this week
- ☆498Updated 5 months ago
- Open-source implementation of AlphaEvolve☆4,310Updated this week
- ☆516Updated 2 months ago
- ☆2,395Updated last week
- ☆1,986Updated last week
- (WIP) A small but powerful, homemade PyTorch from scratch.☆643Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,313Updated 2 months ago