SakanaAI / ab-mcts-arc2Links
☆95Updated 2 months ago
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆150Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆339Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆92Updated this week
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 4 months ago
- LLM reads a paper and produce a working prototype☆57Updated 5 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆96Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆96Updated 2 weeks ago
- accompanying material for sleep-time compute paper☆108Updated 4 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆172Updated 3 weeks ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆51Updated last month
- A coding agent framework, that works on its own codebase.☆63Updated 4 months ago
- Train your own SOTA deductive reasoning model☆106Updated 6 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆117Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 9 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆230Updated 6 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆61Updated 2 months ago
- Implementation of SOAR☆42Updated last month
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆370Updated 2 weeks ago
- ☆56Updated 2 months ago
- ☆25Updated 3 months ago
- ☆83Updated last month
- The official repository of ALE-Bench☆112Updated this week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆58Updated 2 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆32Updated 2 weeks ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 5 months ago
- Train, tune, and infer Bamba model☆131Updated 3 months ago
- ☆78Updated 3 weeks ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆49Updated 2 months ago