nus-apr / auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
☆2,793Updated last week
Alternatives and similar repositories for auto-code-rover:
Users that are interested in auto-code-rover are comparing it to the libraries listed below
- [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?☆2,288Updated this week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,382Updated last month
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,281Updated 3 weeks ago
- ☆2,245Updated 9 months ago
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,213Updated 8 months ago
- Large Action Model framework to develop AI Web Agents☆5,807Updated 2 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,185Updated 6 months ago
- Devon: An open-source pair programmer☆3,333Updated 4 months ago
- The Open Source Memory Layer For Autonomous Agents☆1,955Updated 2 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,720Updated last month
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,188Updated this week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,569Updated 4 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,528Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,775Updated 3 months ago
- Harness LLMs with Multi-Agent Programming☆2,917Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,442Updated 5 months ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,452Updated last month
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,783Updated 4 months ago
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,399Updated this week
- PraisonAI is an AI Agents Framework with Self Reflection. PraisonAI application combines PraisonAI Agents, AutoGen, and CrewAI into a low…☆3,009Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,908Updated this week
- The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.☆4,437Updated this week
- AIOS: AI Agent Operating System☆3,676Updated last week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,580Updated 6 months ago
- AICI: Prompts as (Wasm) Programs☆1,979Updated 2 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆2,619Updated this week
- The first open source Large Action Model generalist Artificial Narrow Intelligence agentic framework that controls completely human user …☆1,276Updated 6 months ago
- Deploy your agentic worfklows to production☆1,915Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆13,996Updated this week
- Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building age…☆1,064Updated last week