aorwall / moatless-tree-search
☆63Updated last month
Alternatives and similar repositories for moatless-tree-search:
Users that are interested in moatless-tree-search are comparing it to the libraries listed below
- ☆73Updated last month
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆57Updated 4 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆48Updated 2 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆102Updated 3 months ago
- ☆40Updated last week
- Enhancing AI Software Engineering with Repository-level Code Graph☆132Updated last month
- ☆38Updated 6 months ago
- ☆54Updated 5 months ago
- ☆28Updated 3 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆145Updated 2 months ago
- ☆108Updated 3 weeks ago
- A benchmark that challenges language models to code solutions for scientific problems☆108Updated this week
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆73Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆46Updated this week
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 10 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆104Updated 8 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 9 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆46Updated last year
- ☆153Updated 5 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆49Updated this week
- ☆74Updated last year
- ☆50Updated 2 months ago
- Evaluating tool-augmented LLMs in conversation settings☆77Updated 8 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆13Updated last month
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆57Updated 5 months ago