nus-apr / auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
☆2,748Updated this week
Alternatives and similar repositories for auto-code-rover:
Users that are interested in auto-code-rover are comparing it to the libraries listed below
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,163Updated 7 months ago
- [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?☆2,038Updated last week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,170Updated 5 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,418Updated this week
- [NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employ…☆13,789Updated last week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,356Updated 4 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,303Updated 3 months ago
- AIOS: AI Agent Operating System☆3,457Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,856Updated this week
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,662Updated last week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI…☆2,285Updated this week
- Agentless🐱: an agentless approach to automatically solve software development problems☆749Updated 3 weeks ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,607Updated last month
- ☆2,241Updated 8 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,690Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,354Updated this week
- Devon: An open-source pair programmer☆3,278Updated 3 months ago
- Prompt design using JSX.☆2,123Updated last month
- PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LL…☆2,308Updated 3 weeks ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,418Updated last week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,294Updated this week
- The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051…☆1,831Updated this week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,532Updated 2 months ago
- Deploy your agentic worfklows to production☆1,857Updated this week
- The easiest way to use Agentic RAG in any enterprise☆3,889Updated 2 weeks ago
- Large Action Model framework to develop AI Web Agents☆5,514Updated 2 weeks ago
- Build resilient language agents as graphs.☆6,953Updated this week
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,349Updated 4 months ago
- Containerized, state of the art Retrieval-Augmented Generation (RAG) system with a RESTful API☆3,770Updated this week
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆4,986Updated last month