SakanaAI / ab-mcts-arc2Links

☆88

Alternatives and similar repositories for ab-mcts-arc2

Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below

Sorting:

SakanaAI / RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆316Updated last month
facebookresearch / collaborative-reasoner
Source code for the collaborative reasoner research project at Meta FAIR.
☆99Updated 3 months ago
StigLidu / DualDistill
The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"
☆84Updated 2 weeks ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆99Updated 3 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆78Updated last week
MaximeRobeyns / self_improving_coding_agent
A coding agent framework, that works on its own codebase.
☆47Updated 3 months ago
du-nlp-lab / MLR-Copilot
☆66Updated 4 months ago
zou-group / sirius
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
☆61Updated 3 weeks ago
kagnlp / CodeGenerator
This repository contains popular code generation frameworks such as MapCoder, CodeSIM.
☆56Updated last month
yale-nlp / SciArena
Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"
☆45Updated last month
yueqis / API-Based-Agent
☆54Updated last month
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆56Updated 2 months ago
LLMSELECTOR / LLMSELECTOR
☆73Updated 5 months ago
gkamradt / SnakeBench
☆88Updated last month
YerbaPage / MGDebugger
Multi-Granularity LLM Debugger
☆87Updated 3 weeks ago
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆93Updated this week
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆72Updated 4 months ago
lamm-mit / PRefLexOR
Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning
☆226Updated 5 months ago
charlesjin / emergent-semantics
☆41Updated last year
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆103Updated 4 months ago
allenai / codescientist
CodeScientist: An automated scientific discovery system for code-based experiments
☆287Updated last month
zjunlp / KnowSelf
[ACL 2025] Agentic Knowledgeable Self-awareness
☆77Updated last month
KempnerInstitute / traveling-waves-integrate
Repository to create traveling waves integrate special information through time
☆53Updated 4 months ago
SakanaAI / ALE-Bench
The official repository of ALE-Bench
☆107Updated 2 weeks ago
goncalorafaria / qalign
QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆23Updated 3 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆84Updated last month
MrYxJ / InfiniRetri
☆53Updated 5 months ago
Think-a-Tron / evolve
open source alpha evolve
☆66Updated 2 months ago
huggingface / screensuite
ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!
☆99Updated last week