githubnext / llmorpheusLinks
LLM-based mutation testing
☆11Updated 6 months ago
Alternatives and similar repositories for llmorpheus
Users that are interested in llmorpheus are comparing it to the libraries listed below
Sorting:
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆39Updated 4 months ago
- ☆11Updated 9 months ago
- Fast and robust AST parsing of any language☆51Updated 7 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆15Updated 4 months ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆13Updated 3 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆54Updated 5 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Static Analysis meets Large Language Models☆50Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆43Updated 8 months ago
- Run SWE-bench evaluations remotely☆40Updated 2 weeks ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆26Updated 9 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆74Updated 11 months ago
- ☆98Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 9 months ago
- ☆109Updated 2 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 10 months ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Updated last year
- ☆64Updated 3 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated last month
- Transform Claude Code transcript JSONL files into readable terminal and HTML formats.☆38Updated last week
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆127Updated last year
- ☆16Updated 7 months ago
- A Python framework for building AI agent systems with robust task management in the form of a graph execution engine, inference capabilit…☆31Updated 2 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆27Updated 5 months ago
- LLM Optimize is a proof-of-concept library for doing LLM (large language model) guided blackbox optimization.☆58Updated 2 years ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆27Updated 3 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆207Updated 5 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆50Updated 10 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆39Updated last year
- Harness used to benchmark aider against SWE Bench benchmarks☆71Updated last year