SalesforceAIResearch / CodeTreeLinks

Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models

☆24

Alternatives and similar repositories for CodeTree

Users that are interested in CodeTree are comparing it to the libraries listed below

Sorting:

bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆68Updated 3 months ago
aorwall / moatless-tree-search
☆99Updated last month
AlexCuadron / ThinkingAgent
Systematic evaluation framework that automatically rates overthinking behavior in large language models.
☆91Updated 2 months ago
du-nlp-lab / MLR-Copilot
☆66Updated 4 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆36Updated last year
oriyor / assistantbench
Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"
☆59Updated 7 months ago
arcee-ai / DAM
☆53Updated 8 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 6 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆52Updated 3 weeks ago
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆99Updated last month
asappresearch / webagents-step
☆41Updated last year
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆99Updated 3 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
MLE-Dojo / MLE-Dojo
☆61Updated last week
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 6 months ago
SalesforceAIResearch / LaTRO
☆118Updated 5 months ago
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆175Updated 4 months ago
InternLM / SWE-Fixer
☆108Updated 2 months ago
dinobby / MAgICoRE
☆24Updated 10 months ago
SalesforceAIResearch / swecomm
☆27Updated 6 months ago
yueqis / API-Based-Agent
☆54Updated last month
schauppi / Self-Rewarding-Language-Models
☆46Updated last year
google-deepmind / llms_can_learn_rules
☆58Updated 7 months ago
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 3 months ago
sher222 / LeReT
Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆49Updated 9 months ago
austrian-code-wizard / c3po
☆29Updated this week
JHU-CLSP / RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆33Updated 10 months ago
zjunlp / KnowSelf
[ACL 2025] Agentic Knowledgeable Self-awareness
☆77Updated last month