SakanaAI / ab-mcts-arc2Links
☆93Updated last month
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆329Updated 2 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆101Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆88Updated this week
- accompanying material for sleep-time compute paper☆105Updated 3 months ago
- The official repository of ALE-Bench☆110Updated last week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆97Updated 3 weeks ago
- A coding agent framework, that works on its own codebase.☆50Updated 4 months ago
- Train, tune, and infer Bamba model☆131Updated 2 months ago
- ☆53Updated 6 months ago
- ☆74Updated 6 months ago
- The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆94Updated last month
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆56Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆73Updated 5 months ago
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆444Updated 3 weeks ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆56Updated 2 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆61Updated last month
- ☆264Updated 4 months ago
- Implementation of SOAR☆41Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- ☆89Updated 2 months ago
- Train transformer language models with reinforcement learning.☆19Updated 6 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆229Updated 6 months ago
- ☆15Updated last month
- Collection of LLM completions for reasoning-gym task datasets☆28Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆73Updated 8 months ago
- ☆54Updated 9 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆398Updated last week
- ☆55Updated last month
- ☆130Updated 5 months ago