nus-apr / auto-code-rover

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.

☆2,793

Alternatives and similar repositories for auto-code-rover:

Users that are interested in auto-code-rover are comparing it to the libraries listed below

swe-bench / SWE-bench
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
☆2,288Updated this week
McGill-NLP / webllama
Llama-3 agents that can browse the web by following instructions and talking to you
☆1,382Updated last month
OpenAutoCoder / Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
☆1,281Updated 3 weeks ago
mshumer / gpt-investor
☆2,245Updated 9 months ago
semanser / codel
✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.
☆2,213Updated 8 months ago
lavague-ai / LaVague
Large Action Model framework to develop AI Web Agents
☆5,807Updated 2 months ago
Doriandarko / maestro
A framework for Claude Opus to intelligently orchestrate subagents.
☆4,185Updated 6 months ago
entropy-research / Devon
Devon: An open-source pair programmer
☆3,333Updated 4 months ago
kingjulio8238 / Memary
The Open Source Memory Layer For Autonomous Agents
☆1,955Updated 2 months ago
Codium-ai / AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
☆3,720Updated last month
SWE-agent / SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…
☆14,188Updated this week
OS-Copilot / OS-Copilot
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
☆1,569Updated 4 months ago
xlang-ai / OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
☆1,528Updated this week
developersdigest / llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
☆4,775Updated 3 months ago
langroid / langroid
Harness LLMs with Multi-Agent Programming
☆2,917Updated this week
lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆3,442Updated 5 months ago
andrewnguonly / Lumos
A RAG LLM co-pilot for browsing the web, powered by local LLMs
☆1,452Updated last month
nilsherzig / LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…
☆5,783Updated 4 months ago
e2b-dev / code-interpreter
Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app
☆1,399Updated this week
MervinPraison / PraisonAI
PraisonAI is an AI Agents Framework with Self Reflection. PraisonAI application combines PraisonAI Agents, AutoGen, and CrewAI into a low…
☆3,009Updated this week
cohere-ai / cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆2,908Updated this week
SciPhi-AI / R2R
The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.
☆4,437Updated this week
agiresearch / AIOS
AIOS: AI Agent Operating System
☆3,676Updated last week
SqueezeAILab / LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
☆1,580Updated 6 months ago
microsoft / aici
AICI: Prompts as (Wasm) Programs
☆1,979Updated 2 months ago
AgentOps-AI / agentops
Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…
☆2,619Updated this week
a-real-ai / pywinassistant
The first open source Large Action Model generalist Artificial Narrow Intelligence agentic framework that controls completely human user …
☆1,276Updated 6 months ago
run-llama / llama_deploy
Deploy your agentic worfklows to production
☆1,915Updated this week
letta-ai / letta
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
☆13,996Updated this week
Div99 / agent-protocol
Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building age…
☆1,064Updated last week