Aider-AI / refactor-benchmarkLinks

Aider's refactoring benchmark exercises based on popular python repos

☆77

Alternatives and similar repositories for refactor-benchmark

Users that are interested in refactor-benchmark are comparing it to the libraries listed below

Sorting:

Aider-AI / aider-swe-bench
Harness used to benchmark aider against SWE Bench benchmarks
☆72Updated last year
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆87Updated 10 months ago
Aider-AI / polyglot-benchmark
Coding problems used in aider's polyglot benchmark
☆167Updated 7 months ago
llllvvuu / instant_apply
proof-of-concept of Cursor's Instant Apply feature
☆83Updated 11 months ago
Technoculture / personal-graph
Simple Graph Memory for AI applications
☆89Updated 2 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
All-Hands-AI / openhands-aci
Agent computer interface for AI software engineer.
☆97Updated this week
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
rgbkrk / chatlab
⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol
☆148Updated 10 months ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
zhudotexe / redel
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)
☆83Updated 4 months ago
NL2Code / CodeR
☆159Updated 11 months ago
AnswerDotAI / web2md
Convert a web page to markdown
☆77Updated 11 months ago
GoodAI / goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…
☆76Updated 7 months ago
yoheinakajima / captainfunction
A Python package to dynamically load functions for OpenAI Assistant
☆54Updated last year
emrgnt-cmplxty / zero-shot-replication
☆74Updated last year
neoxelox / dspy-inspector
DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.
☆37Updated last year
miralab-ai / autoreason
☆40Updated 7 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
uukuguy / speechless
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
☆104Updated last week
invariantlabs-ai / explorer
A better way of testing, inspecting, and analyzing AI Agent traces.
☆39Updated last month
jmanhype / dspy-self-discover-framework
Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…
☆67Updated last year
gradion-ai / freeact
An AI agent library using Python as the common language to define executable actions and tool interfaces.
☆83Updated last week
nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆35Updated last year
All-Hands-AI / openhands-resolver
A system that tries to resolve all issues on a github repo with OpenHands.
☆110Updated 8 months ago
codestoryai / prompts
Contains the prompts we use to talk to various LLMs for different utilities inside the editor
☆80Updated last year
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
simonw / llm-cluster
LLM plugin for clustering embeddings
☆80Updated last year
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆88Updated last year
irthomasthomas / undecidability
☆22Updated last year