SakanaAI / ab-mcts-arc2Links
☆98Updated 3 months ago
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆159Updated last month
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆478Updated last week
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 5 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆94Updated last week
- The official repository of ALE-Bench☆117Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- accompanying material for sleep-time compute paper☆115Updated 5 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆95Updated last week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆475Updated 2 months ago
- LLM reads a paper and produce a working prototype☆56Updated 5 months ago
- RLP: Reinforcement as a Pretraining Objective☆69Updated last week
- A coding agent framework, that works on its own codebase.☆122Updated 5 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆52Updated 2 months ago
- open source alpha evolve☆67Updated 4 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 6 months ago
- Implementation of SOAR☆43Updated 2 weeks ago
- ☆86Updated this week
- An Automatic Prompt Optimization Framework for Large Language Models☆122Updated 2 months ago
- Train your own SOTA deductive reasoning model☆107Updated 7 months ago
- ☆93Updated 3 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆294Updated 3 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 4 months ago
- ☆70Updated this week
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated 2 weeks ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated last month
- Train, tune, and infer Bamba model☆133Updated 4 months ago
- ☆85Updated 3 months ago
- ☆84Updated last month
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆112Updated 2 months ago