cavaunpeu / mcts-llm-codegenLinks
A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)
☆18Updated last year
Alternatives and similar repositories for mcts-llm-codegen
Users that are interested in mcts-llm-codegen are comparing it to the libraries listed below
Sorting:
- ☆47Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆72Updated last year
- ☆144Updated last year
- ☆41Updated last year
- ☆21Updated 3 months ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆131Updated 11 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- ☆59Updated last year
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆126Updated 3 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 7 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆110Updated 10 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆61Updated 5 months ago
- Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation☆46Updated last year
- ☆23Updated 5 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆106Updated 2 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- LILO: Library Induction with Language Observations☆88Updated last year
- ☆119Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆128Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 2 months ago
- A hard gym for programming☆160Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated 11 months ago
- ☆78Updated 6 months ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Code repo for MathAgent☆17Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- ☆21Updated last year