aorwall / moatless-tree-search
☆79Updated 3 weeks ago
Alternatives and similar repositories for moatless-tree-search:
Users that are interested in moatless-tree-search are comparing it to the libraries listed below
- ☆85Updated last week
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆66Updated 2 weeks ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated 3 weeks ago
- ☆92Updated 9 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆54Updated 4 months ago
- r2e: turn any github repository into a programming agent environment☆116Updated 2 weeks ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆172Updated 2 months ago
- CodeUltraFeedback: aligning large language models to coding preferences☆71Updated 10 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆159Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆88Updated 3 weeks ago
- ☆114Updated 2 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆108Updated 6 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆20Updated last month
- ☆37Updated 3 months ago
- ☆40Updated 9 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated this week
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆60Updated 7 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆64Updated 8 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆122Updated 10 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 7 months ago
- ☆31Updated this week
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆35Updated last week
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- Small, simple agent task environments for training and evaluation☆18Updated 6 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆71Updated 10 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated 11 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆117Updated this week
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆112Updated last week