SakanaAI / repoLinks
RePo: Language Models with Context Re-Positioning
☆64Updated last month
Alternatives and similar repositories for repo
Users that are interested in repo are comparing it to the libraries listed below
Sorting:
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆50Updated 2 weeks ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆94Updated 2 weeks ago
- Official repo of paper LM2☆46Updated 11 months ago
- ☆42Updated 8 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 5 months ago
- ☆29Updated 2 months ago
- Official Repository of Native Parallel Reasoner☆100Updated last week
- ☆19Updated 10 months ago
- ☆169Updated 4 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Updated 5 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆42Updated 9 months ago
- Official Repository for Task-Circuit Quantization☆24Updated 8 months ago
- ☆49Updated 4 months ago
- Official Project Page for Web World Models (https://arxiv.org/abs/2512.23676)☆80Updated 3 weeks ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding☆193Updated 2 weeks ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Updated last month
- accompanying material for sleep-time compute paper☆119Updated 9 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆111Updated 2 months ago
- ☆66Updated this week
- SSRL: Self-Search Reinforcement Learning☆205Updated 5 months ago
- ☆21Updated 6 months ago
- ☆24Updated 9 months ago
- ☆35Updated 8 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Updated 7 months ago
- ☆67Updated 10 months ago
- ☆23Updated last year
- LIMI: Less is More for Agency☆159Updated 3 months ago