nus-apr / auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite and 38.40% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
☆2,711Updated last week
Related projects ⓘ
Alternatives and complementary repositories for auto-code-rover
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,141Updated 6 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,160Updated 4 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI…☆2,010Updated this week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,505Updated 2 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,215Updated 2 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,326Updated this week
- [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?☆1,943Updated last week
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,620Updated last week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,642Updated last month
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,349Updated 3 months ago
- Devon: An open-source pair programmer☆3,250Updated 2 months ago
- PraisonAI application combines AutoGen and CrewAI or similar frameworks into a low-code solution for building and managing multi-agent LL…☆2,260Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,593Updated 3 weeks ago
- Harness LLMs with Multi-Agent Programming☆2,611Updated this week
- The first open source Large Action Model generalist Artificial Narrow Intelligence agentic framework that controls completely human user …☆1,263Updated 4 months ago
- Prompt design using JSX.☆2,010Updated 3 weeks ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆710Updated last week
- The easiest way to use Agentic RAG in any enterprise☆3,814Updated this week
- A collection of examples that show how to use CrewAI framework to automate workflows.☆2,876Updated this week
- Large Action Model framework to develop AI Web Agents☆5,448Updated 3 weeks ago
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,336Updated 3 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,822Updated this week
- Build resilient language agents as graphs.☆6,531Updated this week
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app☆1,247Updated last week
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,394Updated last month
- The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications☆3,553Updated this week
- Deploy your agentic worfklows to production☆1,819Updated this week
- Mentat - The AI Coding Assistant☆2,565Updated 5 months ago
- AIOS: LLM Agent Operating System☆3,390Updated this week
- Parse files for optimal RAG☆3,033Updated this week