githubnext / llmorpheus
LLM-based mutation testing
☆11Updated 2 months ago
Alternatives and similar repositories for llmorpheus:
Users that are interested in llmorpheus are comparing it to the libraries listed below
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆37Updated last week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- Static Analysis meets Large Language Models☆49Updated 11 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆50Updated last month
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆26Updated 5 months ago
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆15Updated last year
- Leveraging LLMs for modernization through intelligent chunking, iterative prompting and reflection, and retrieval augmented generation (R…☆30Updated 2 weeks ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated 2 weeks ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Updated 11 months ago
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆122Updated 10 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 9 months ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 3 months ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆12Updated 3 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated last week
- Two Automatic code completion IDE extensions for @JetBrains and @microsoft/vscode based on Transformer-based large language models for so…☆55Updated last year
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Fast and robust AST parsing of any language☆38Updated 3 months ago
- ☆15Updated last month
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆30Updated 8 months ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆62Updated 7 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Incremental Python parser for constrained generation of code by LLMs.☆16Updated 7 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆78Updated last week
- EvoEval: Evolving Coding Benchmarks via LLM☆68Updated last year
- iauto is a low-code engine for building and deploying AI agents☆86Updated 5 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆107Updated 5 months ago
- Dianshu-Liao / AAA-Code-Generation-Framework-for-Code-Repository-Local-Aware-Global-Aware-Third-Party-Aware☆19Updated last year
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆68Updated 7 months ago