AutoCodeRoverSG / auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
☆2,810Updated 3 weeks ago
Alternatives and similar repositories for auto-code-rover:
Users that are interested in auto-code-rover are comparing it to the libraries listed below
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,723Updated 2 months ago
- [ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?☆2,341Updated last week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,361Updated last month
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,388Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,922Updated this week
- ✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.☆2,229Updated 9 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,501Updated 5 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆2,719Updated this week
- structured outputs for llms☆9,140Updated this week
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,565Updated last week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,297Updated this week
- Large Action Model framework to develop AI Web Agents☆5,839Updated last week
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,583Updated 4 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,794Updated 4 months ago
- Harness LLMs with Multi-Agent Programming☆2,965Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,546Updated last week
- AICI: Prompts as (Wasm) Programs☆1,995Updated last week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,196Updated 6 months ago
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆829Updated 6 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆1,993Updated 2 months ago
- Zep | The Memory Foundation For Your AI Stack☆2,923Updated 2 months ago
- The Open Source Memory Layer For Autonomous Agents☆1,971Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,200Updated 4 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,148Updated this week
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api☆997Updated this week
- AutoGroq is a groundbreaking tool that revolutionizes the way users interact with Autogen™ and other AI assistants. By dynamically genera…☆1,402Updated last month
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,460Updated this week
- Devon: An open-source pair programmer☆3,354Updated 5 months ago
- No-code multi-agent framework to build LLM Agents, workflows and applications with your data☆1,750Updated last month
- Deploy your agentic worfklows to production☆1,928Updated this week