AutoCodeRoverSG / auto-code-roverLinks
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
β3,029Updated 6 months ago
Alternatives and similar repositories for auto-code-rover
Users that are interested in auto-code-rover are comparing it to the libraries listed below
Sorting:
- Devon: An open-source pair programmerβ3,457Updated 5 months ago
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ1,956Updated 10 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""β3,901Updated 11 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.β4,282Updated last year
- β¨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor.β2,399Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to youβ1,407Updated 11 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.β1,692Updated last year
- SWE-bench: Can Language Models Resolve Real-world Github Issues?β3,796Updated last month
- Large Action Model framework to develop AI Web Agentsβ6,201Updated 9 months ago
- β2,264Updated last year
- Python & JS/TS SDK for running AI-generated code/code interpreting in your AI appβ2,076Updated this week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecβ¦β17,793Updated last week
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instructβ2,054Updated last year
- β2,565Updated 10 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serperβ4,926Updated 4 months ago
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environmentsβ2,315Updated this week
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-userβ¦β1,309Updated 9 months ago
- Generate and auto-execute Python scripts in the cliβ1,805Updated 3 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophistβ¦β1,688Updated last year
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 β¦β856Updated last year
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.β5,093Updated last year
- Together Mixture-Of-Agents (MoA) β 65.1% on AlpacaEval with OSS modelsβ2,835Updated 10 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.β3,146Updated this week
- A RAG LLM co-pilot for browsing the web, powered by local LLMsβ1,509Updated 9 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Callingβ1,780Updated last year
- β938Updated last year
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ4,411Updated last year
- Prompt design using JSX.β2,720Updated last month
- A code-first agent framework for seamlessly planning and executing data analytics tasks.β5,991Updated 2 weeks ago
- Vision utilities for web interaction agents πβ1,740Updated 11 months ago