codestoryai / swe_bench_tracesLinks
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated last year
Alternatives and similar repositories for swe_bench_traces
Users that are interested in swe_bench_traces are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated last year
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆29Updated last week
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆38Updated last year
- ☆11Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated 2 weeks ago
- CLI that uses DSPy to interact with MCP servers.☆23Updated 8 months ago
- ☆11Updated last year
- Task management for AI agents☆15Updated 4 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated last month
- Streamlit Web UI for AGiXT☆28Updated 4 months ago
- Code Interpreter Replica☆25Updated 2 years ago
- Setup an MCP server in 60 seconds.☆13Updated 11 months ago
- ☆14Updated last year
- ☆20Updated last year
- ☆15Updated last month
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆15Updated this week
- watch your screen while doing sales and fill your crm automatically☆17Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆47Updated last month
- The Swarm Ecosystem☆26Updated last year
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆40Updated 2 weeks ago
- A minimal Model Context Protocol 🖥️ server/client 🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆30Updated 7 months ago
- The world's first fully automated VC fund.☆24Updated 2 weeks ago
- LangChain + LiteLLM that works☆49Updated 2 months ago
- examples and guides to using Nomic Atlas☆38Updated 6 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated last year
- ☆27Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Updated last year
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆57Updated 8 months ago