SakanaAI / ab-mcts-arc2Links
☆105Updated 5 months ago
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆225Updated this week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆352Updated 5 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆110Updated 7 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- ☆76Updated 2 months ago
- RLP: Reinforcement as a Pretraining Objective☆205Updated 2 months ago
- ☆85Updated 5 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- ☆88Updated last month
- The official repository of ALE-Bench☆134Updated 2 weeks ago
- Train, tune, and infer Bamba model☆136Updated 6 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆36Updated 2 weeks ago
- ☆97Updated this week
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆25Updated 3 weeks ago
- Public repository containing METR's DVC pipeline for eval data analysis☆140Updated 8 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated last month
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 6 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- ☆99Updated 2 months ago
- ☆18Updated 4 months ago
- ☆67Updated 8 months ago
- Implementation of SOAR☆43Updated 2 months ago
- ☆19Updated 9 months ago
- 🧬 The Huxley-Gödel Machine☆305Updated last week
- open source alpha evolve☆67Updated 6 months ago
- ☆137Updated 2 months ago
- ☆79Updated 2 months ago