SalesforceAIResearch / CodeTree
Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
☆20Updated last month
Alternatives and similar repositories for CodeTree:
Users that are interested in CodeTree are comparing it to the libraries listed below
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆50Updated 2 months ago
- ☆26Updated 3 months ago
- ☆63Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 4 months ago
- ☆79Updated 2 weeks ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆88Updated 3 weeks ago
- ☆24Updated 7 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- ☆50Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 8 months ago
- ☆40Updated 9 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆35Updated last week
- ☆85Updated last week
- ☆42Updated last month
- Knowledge Unlearning for Large Language Models☆25Updated this week
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 3 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 5 months ago
- ☆41Updated 4 months ago
- ☆15Updated 3 weeks ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- ☆48Updated 5 months ago
- ☆27Updated 2 weeks ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆36Updated 10 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆96Updated 6 months ago
- ☆24Updated this week
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆33Updated 6 months ago