githubnext / llmorpheus
LLM-based mutation testing
☆11Updated 3 months ago
Alternatives and similar repositories for llmorpheus
Users that are interested in llmorpheus are comparing it to the libraries listed below
Sorting:
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 10 months ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆38Updated last month
- Fast and robust AST parsing of any language☆39Updated 4 months ago
- Static Analysis meets Large Language Models☆50Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 4 months ago
- Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…☆10Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated last month
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆29Updated 9 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated last week
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆12Updated last week
- Harness used to benchmark aider against SWE Bench benchmarks☆71Updated 10 months ago
- WAFFLE: Multi-Modal Model for Automated Front-End Development - by Shanchao Liang and Nan Jiang and Shangshu Qian and Lin Tan☆10Updated 4 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- repo for the paper titled “CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation”☆15Updated last year
- Incremental Python parser for constrained generation of code by LLMs.☆16Updated 7 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆68Updated 8 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 6 months ago
- Setup an MCP server in 60 seconds.☆12Updated 5 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 4 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆30Updated 9 months ago
- RepairAgent is an autonomous LLM-based agent for software repair.☆41Updated last month
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆55Updated 2 months ago
- A Comparative Study of Various Code Embeddings in Software Semantic Matching☆16Updated 2 years ago
- ☆30Updated 2 months ago
- ☆41Updated 5 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 7 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 6 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆26Updated 3 weeks ago
- ☆15Updated 4 months ago