SakanaAI / ab-mcts-arc2Links
☆103Updated 3 months ago
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆346Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆97Updated this week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆103Updated 2 weeks ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 4 months ago
- RLP: Reinforcement as a Pretraining Objective☆192Updated 3 weeks ago
- ☆92Updated 3 weeks ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆53Updated 2 months ago
- accompanying material for sleep-time compute paper☆117Updated 5 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆27Updated last week
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆63Updated 4 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆584Updated last week
- ☆84Updated 3 months ago
- ☆58Updated 4 months ago
- ☆93Updated 4 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated last month
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- open source alpha evolve☆66Updated 5 months ago
- ☆83Updated 2 months ago
- ☆18Updated 3 months ago
- The official repository of ALE-Bench☆120Updated 2 weeks ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆232Updated 8 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆225Updated last week
- Code for ExploreTom☆86Updated 4 months ago
- Implementation of SOAR☆42Updated last month
- Train, tune, and infer Bamba model☆135Updated 4 months ago
- ☆25Updated 5 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆479Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago