codestoryai / swe_bench_tracesLinks
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated last year
Alternatives and similar repositories for swe_bench_traces
Users that are interested in swe_bench_traces are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- ☆11Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated last week
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆48Updated 2 months ago
- CLI that uses DSPy to interact with MCP servers.☆23Updated 9 months ago
- The Swarm Ecosystem☆26Updated last year
- Task management for AI agents☆15Updated 6 months ago
- Code Interpreter Replica☆26Updated 2 years ago
- Setup an MCP server in 60 seconds.☆13Updated last year
- ☆11Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆16Updated this week
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆31Updated last month
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆41Updated 2 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Updated last year
- This repository `II-Commons` contains tools for managing text and image datasets, including loading, fetching, and embedding large datase…☆33Updated 5 months ago
- watch your screen while doing sales and fill your crm automatically☆17Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆25Updated last year
- A daemon that makes a desktop OS accessible to AI agents☆36Updated 6 months ago
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Updated 3 weeks ago
- A forest of autonomous agents.☆19Updated 11 months ago
- ☆14Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆23Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆43Updated this week
- Streamlit Web UI for AGiXT☆28Updated 3 weeks ago