codestoryai / swe_bench_traces
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated 7 months ago
Alternatives and similar repositories for swe_bench_traces:
Users that are interested in swe_bench_traces are comparing it to the libraries listed below
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- The Swarm Ecosystem☆19Updated 6 months ago
- A Model Context Protocol (MCP) server that provides JSON-RPC functionality through OpenRPC.☆18Updated 2 weeks ago
- ☆30Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆13Updated last week
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆30Updated last week
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- Agent computer interface for AI software engineer.☆33Updated this week
- A library for generating structured JSON using GPT-4o.☆13Updated 6 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- Training hybrid models for dummies.☆20Updated last month
- watch your screen while doing sales and fill your crm automatically☆15Updated 8 months ago
- A QT GUI for large language models☆30Updated last year
- A Model Context Protocol (MCP) server that provides tools for fetching and analyzing Reddit content.☆16Updated 3 weeks ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 9 months ago
- An LLM playground similar to the OpenAI API playground☆21Updated last year
- ☆14Updated last month
- A better way of testing, inspecting, and analyzing AI Agent traces.☆28Updated this week
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 9 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆26Updated last year
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated last year
- An open-source integration of GraphRAG for Agentic System with NoCode☆12Updated last month
- LLMs as Collaboratively Edited Knowledge Bases☆44Updated 11 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago
- An Infr app that automates data collection from your PC, macOS or Linux client.☆11Updated last year