SakanaAI / ab-mcts-arc2Links
☆88Updated last month
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆316Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆99Updated 3 months ago
- The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆84Updated 2 weeks ago
- accompanying material for sleep-time compute paper☆99Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆78Updated last week
- A coding agent framework, that works on its own codebase.☆47Updated 3 months ago
- ☆66Updated 4 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆61Updated 3 weeks ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆56Updated last month
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆45Updated last month
- ☆54Updated last month
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆56Updated 2 months ago
- ☆73Updated 5 months ago
- ☆88Updated last month
- Multi-Granularity LLM Debugger☆87Updated 3 weeks ago
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆93Updated this week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆72Updated 4 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆226Updated 5 months ago
- ☆41Updated last year
- Train your own SOTA deductive reasoning model☆103Updated 4 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆287Updated last month
- [ACL 2025] Agentic Knowledgeable Self-awareness☆77Updated last month
- Repository to create traveling waves integrate special information through time☆53Updated 4 months ago
- The official repository of ALE-Bench☆107Updated 2 weeks ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 3 months ago
- Code for ExploreTom☆84Updated last month
- ☆53Updated 5 months ago
- open source alpha evolve☆66Updated 2 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆99Updated last week