aorwall / moatless-tools
☆351Updated 2 weeks ago
Alternatives and similar repositories for moatless-tools:
Users that are interested in moatless-tools are comparing it to the libraries listed below
- ☆153Updated 5 months ago
- ☆83Updated 7 months ago
- AWM: Agent Workflow Memory☆242Updated 3 weeks ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆133Updated last month
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆208Updated 9 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆462Updated 11 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,455Updated last month
- Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"☆325Updated 3 weeks ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆358Updated last month
- Code and Data for Tau-Bench☆273Updated 3 weeks ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆146Updated 2 weeks ago
- LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step☆495Updated 5 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆600Updated 8 months ago
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆235Updated this week
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆109Updated 3 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆866Updated 2 weeks ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆730Updated 6 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆305Updated 5 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆369Updated last year
- 🌎💪 BrowserGym, a Gym environment for web task automation☆527Updated 3 weeks ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆116Updated 8 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆451Updated 2 weeks ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆336Updated 8 months ago
- ☆362Updated last month
- Code for the paper 🌳 Tree Search for Language Model Agents☆178Updated 6 months ago
- This Repo is the official implementation of AgentCoder and AgentCoder+.☆287Updated this week
- A framework-less approach to robust agent development.☆154Updated this week
- A codebase for "Language Models can Solve Computer Tasks"☆232Updated 9 months ago
- An agent benchmark with tasks in a simulated software company.☆247Updated this week
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆293Updated this week