codestoryai / swe_bench_tracesLinks
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated last year
Alternatives and similar repositories for swe_bench_traces
Users that are interested in swe_bench_traces are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- CLI that uses DSPy to interact with MCP servers.☆24Updated 11 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
- ☆12Updated last year
- ☆11Updated last year
- Setup an MCP server in 60 seconds.☆13Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Updated 3 months ago
- CLI for Recursive Language Models☆37Updated last week
- Code Interpreter Replica☆26Updated 2 years ago
- ☆26Updated last year
- Task management for AI agents☆15Updated 7 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆32Updated 3 months ago
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆31Updated 10 months ago
- examples and guides to using Nomic Atlas☆37Updated 9 months ago
- The Swarm Ecosystem☆26Updated last year
- watch your screen while doing sales and fill your crm automatically☆17Updated last year
- A forest of autonomous agents.☆19Updated last year
- ☆14Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Updated last month
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆60Updated 11 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆50Updated 3 months ago
- Visualize any repo or codebase into diagram or animation☆20Updated last year
- ☆20Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Updated 2 years ago
- Easiest way to build custom agents, in a no-code notion style editor, using simple macros.☆34Updated last year
- Example for Logging LLM Evaluator Prompt Responses☆18Updated 2 years ago
- Access the Cohere Command R family of models☆38Updated 10 months ago