codestoryai / swe_bench_traces
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for swe_bench_traces
- LangChain + LiteLLM that works☆24Updated last week
- ☆30Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- The official Python library for Formulaic☆14Updated 6 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated 9 months ago
- ☆14Updated 7 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆11Updated last week
- A QT GUI for large language models☆24Updated 10 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆25Updated 11 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆19Updated last week
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆9Updated 5 months ago
- ☆18Updated 8 months ago
- Structured outputs from DSPy and Jinja2☆14Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Apps that run on modal.com☆12Updated 5 months ago
- TaskWeaver Plugins☆12Updated 9 months ago
- Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.☆20Updated 5 months ago
- ☆11Updated 2 months ago
- ☆26Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆32Updated 5 months ago
- AI_Powered_Dev_Search_Engine☆12Updated 8 months ago
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆23Updated 11 months ago
- A framework for hosting and scaling AI agents.☆18Updated this week
- Github repo for storing LlamaDatasets☆29Updated 9 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- A function to do all☆35Updated 6 months ago
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆16Updated this week
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆18Updated 3 weeks ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆27Updated last week
- An LLM playground similar to the OpenAI API playground☆17Updated 10 months ago