AutoCodeRoverSG / auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
☆2,847Updated this week
Alternatives and similar repositories for auto-code-rover:
Users that are interested in auto-code-rover are comparing it to the libraries listed below
- SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?☆2,555Updated this week
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,771Updated 3 months ago
- Devon: An open-source pair programmer☆3,378Updated 6 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,520Updated 2 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,666Updated 6 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,389Updated 2 months ago
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,265Updated 10 months ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,809Updated this week
- Harness LLMs with Multi-Agent Programming☆3,104Updated this week
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,651Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆2,899Updated this week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,208Updated 8 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,975Updated last week
- Large Action Model framework to develop AI Web Agents☆5,916Updated last month
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,618Updated 7 months ago
- AICI: Prompts as (Wasm) Programs☆2,002Updated last month
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,553Updated 3 weeks ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,514Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,856Updated 5 months ago
- AIOS: AI Agent Operating System☆3,867Updated this week
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆5,037Updated 4 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆1,995Updated 4 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,909Updated last month
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,621Updated 5 months ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,686Updated last month
- A language model programming library.☆5,649Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,587Updated this week
- Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building age…☆1,110Updated last month
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,288Updated 2 weeks ago
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆835Updated 7 months ago