githubnext / llmorpheusLinks
LLM-based mutation testing
☆11Updated 5 months ago
Alternatives and similar repositories for llmorpheus
Users that are interested in llmorpheus are comparing it to the libraries listed below
Sorting:
- Static Analysis meets Large Language Models☆50Updated last year
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆12Updated 2 months ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆39Updated 3 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆39Updated last week
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆73Updated 10 months ago
- ☆96Updated 10 months ago
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆26Updated 7 months ago
- The official Python SDK for Codellm-Devkit☆106Updated 2 weeks ago
- Fast and robust AST parsing of any language☆44Updated 6 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆72Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 6 months ago
- ☆94Updated last month
- Leveraging LLMs for modernization through intelligent chunking, iterative prompting and reflection, and retrieval augmented generation (R…☆34Updated this week
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 8 months ago
- A tool to build a graph from a codebase☆132Updated this week
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆54Updated 4 months ago
- ☆19Updated 2 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆13Updated 3 months ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆46Updated last week
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆64Updated 10 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 11 months ago
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆79Updated last year
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆109Updated 8 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆191Updated 3 months ago
- ☆65Updated last year
- ☆16Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆124Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆111Updated 8 months ago