SakanaAI / ab-mcts-arc2Links
☆67Updated 2 weeks ago
Alternatives and similar repositories for ab-mcts-arc2
Users that are interested in ab-mcts-arc2 are comparing it to the libraries listed below
Sorting:
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆55Updated last month
- Very minimal (and stateless) agent framework☆44Updated 6 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆295Updated 3 weeks ago
- Train, tune, and infer Bamba model☆130Updated last month
- LLM reads a paper and produce a working prototype☆58Updated 3 months ago
- ☆66Updated 3 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆87Updated last week
- Simple repository for training small reasoning models☆33Updated 5 months ago
- accompanying material for sleep-time compute paper☆97Updated 2 months ago
- ☆40Updated 6 months ago
- Repository to create traveling waves integrate special information through time☆53Updated 4 months ago
- ☆23Updated 3 weeks ago
- ☆22Updated last month
- The official repository of ALE-Bench☆98Updated this week
- ☆37Updated last week
- ☆53Updated 4 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 8 months ago
- Official Code Release for "Training a Generally Curious Agent"☆26Updated last month
- ☆24Updated 3 weeks ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆63Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆94Updated 2 months ago
- ☆55Updated 2 weeks ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 3 months ago
- ☆41Updated 11 months ago
- ☆71Updated 4 months ago
- Code for LitLLMs, LLMs for Literature Review: Are we there yet? (TMLR 2025)☆33Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated last week
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- ☆69Updated last month
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 3 months ago