Aider-AI / refactor-benchmark
Aider's refactoring benchmark exercises based on popular python repos
☆44Updated last month
Related projects ⓘ
Alternatives and complementary repositories for refactor-benchmark
- Chat Markup Language conversation library☆54Updated 10 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆52Updated 4 months ago
- Convert a web page to markdown☆53Updated 2 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- ☆48Updated last year
- ☆72Updated last year
- Simple Graph Memory for AI applications☆79Updated 3 months ago
- ☆40Updated 6 months ago
- Simple examples using Argilla tools to build AI☆38Updated last week
- Natural Language Interfaces Powered by LLMs☆91Updated 3 months ago
- Small, simple agent task environments for training and evaluation☆16Updated last week
- A framework for evaluating function calls made by LLMs☆34Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!☆42Updated 7 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆47Updated last month
- 🤖 Headless IDE for AI agents☆129Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆62Updated this week
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆56Updated 3 months ago
- Demo of ConversationEntityMemory in Streamlit.☆51Updated last year
- A framework for orchestrating AI agents using a mermaid graph☆74Updated 5 months ago
- ☆22Updated 4 months ago
- Routing on Random Forest (RoRF)☆83Updated last month
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆110Updated 3 weeks ago
- ☆75Updated 9 months ago
- ☆30Updated 4 months ago
- Helper functions to generate JSON schema dicts for OpenAI ChatGPT function calling requests.☆66Updated this week
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆27Updated 5 months ago
- ☆31Updated 2 weeks ago
- ☆104Updated 7 months ago
- 🐣🕐📅 A simple utility to draft scheduling emails.☆12Updated last year