codestoryai / swe_bench_tracesLinks
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated last year
Alternatives and similar repositories for swe_bench_traces
Users that are interested in swe_bench_traces are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated 3 weeks ago
- Setup an MCP server in 60 seconds.☆13Updated last year
- ☆11Updated last year
- CLI that uses DSPy to interact with MCP servers.☆23Updated 9 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- Task management for AI agents☆15Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Code Interpreter Replica☆26Updated 2 years ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆16Updated this week
- ☆11Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- watch your screen while doing sales and fill your crm automatically☆17Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆31Updated 6 months ago
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆31Updated last month
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆48Updated 2 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- ☆20Updated last year
- The Swarm Ecosystem☆26Updated last year
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Updated 2 weeks ago
- Learn how to use logit bias with OpenAI models to create highly-powerful classifiers in minutes.☆34Updated 2 years ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Updated last year
- A forest of autonomous agents.☆19Updated 10 months ago
- Ready-to-use agent that can interact directly with any tool or native endpoint, in less than 5 lines of code☆42Updated 2 months ago
- ☆33Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Updated 9 months ago
- examples and guides to using Nomic Atlas☆37Updated 8 months ago
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆31Updated 8 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆41Updated last month
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆43Updated last week