codestoryai / swe_bench_traces
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated 6 months ago
Alternatives and similar repositories for swe_bench_traces:
Users that are interested in swe_bench_traces are comparing it to the libraries listed below
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated 2 months ago
- The Swarm Ecosystem☆18Updated 5 months ago
- watch your screen while doing sales and fill your crm automatically☆14Updated 7 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆13Updated this week
- Nexusflow function call, tool use, and agent benchmarks.☆18Updated last month
- ☆30Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆34Updated 8 months ago
- ☆15Updated 9 months ago
- A QT GUI for large language models☆26Updated last year
- A Model Context Protocol (MCP) server that helps read GitHub repository structure and important files.☆20Updated 3 weeks ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated 11 months ago
- ☆11Updated 8 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆28Updated last month
- An autonomous orchestrator that unites and manages open-source devs for complex problems by faciliting synergy between multiple Discord s…☆12Updated 4 months ago
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆23Updated last year
- An all-new OS that orchestrates autonomous agents as workers to execute tasks.☆17Updated 2 months ago
- ☆24Updated 4 months ago
- ☆19Updated 10 months ago
- FalkorDB-Browser is a visualization UI for FalkorDB.☆23Updated this week
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆25Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆30Updated 11 months ago
- Code Interpreter Replica☆20Updated last year
- The official Python library for Formulaic☆15Updated 8 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆32Updated 3 weeks ago
- time based thinking and structure like OpenAI's o1 preview.☆11Updated 4 months ago
- Agent computer interface for AI software engineer.☆22Updated this week
- ☆27Updated 7 months ago
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆16Updated 5 months ago
- ☆11Updated 4 months ago