cavaunpeu / mcts-llm-codegenLinks

A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)

☆17

Alternatives and similar repositories for mcts-llm-codegen

Users that are interested in mcts-llm-codegen are comparing it to the libraries listed below

Sorting:

scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated last year
sher222 / LeReT
Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆39Updated 7 months ago
rmshin / llm-mcts
☆41Updated last year
itl-ed / llm-dp
LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task
☆43Updated 5 months ago
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆28Updated last year
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆40Updated last year
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆148Updated 4 months ago
likenneth / q_probe
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆41Updated last year
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 2 months ago
esteng / regal_program_learning
☆24Updated 9 months ago
JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆18Updated 7 months ago
agential-ai / agential
🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
☆52Updated this week
agiresearch / Formal-LLM
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents
☆124Updated last year
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆42Updated this week
oashua / MathAgent
Code repo for MathAgent
☆16Updated last year
YerbaPage / MGDebugger
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
☆73Updated 3 weeks ago
OSU-NLP-Group / reversal-curse-binding
☆23Updated 2 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
ack-sec / toyberry
Toy implementation of Strawberry
☆33Updated 9 months ago
gabegrand / lilo
LILO: Library Induction with Language Observations
☆87Updated 9 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆63Updated 2 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆76Updated last year
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆95Updated 2 weeks ago
du-nlp-lab / MLR-Copilot
☆65Updated 2 months ago
benpry / why-think-step-by-step
Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"
☆60Updated 2 months ago
portal-cornell / muCode
☆19Updated 3 months ago
austrian-code-wizard / c3po
☆27Updated this week
joshuacnf / Ctrl-G
☆86Updated 5 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
nyu-mll / ILF-for-code-generation
☆76Updated 3 months ago