SakanaAI / ab-mcts-arc2Links
☆106Updated 6 months ago
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆256Updated this week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆357Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- accompanying material for sleep-time compute paper☆118Updated 8 months ago
- ☆92Updated 2 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag …☆126Updated 3 months ago
- Implementation of SOAR☆45Updated 4 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆121Updated last week
- RLP: Reinforcement as a Pretraining Objective☆223Updated 3 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 5 months ago
- Train your own SOTA deductive reasoning model☆107Updated 10 months ago
- ☆67Updated 9 months ago
- open source alpha evolve☆68Updated 7 months ago
- ☆226Updated 10 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆87Updated 9 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆297Updated 2 weeks ago
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- Codebase from our first release.☆39Updated last week
- ☆20Updated 5 months ago
- ☆53Updated 11 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆512Updated last month
- ☆114Updated 3 months ago
- ☆97Updated last month
- ☆86Updated 6 months ago
- ☆81Updated 3 months ago
- The official repository of ALE-Bench☆149Updated last week
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆283Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year