CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that enables in-depth analysis of the results.
☆42Feb 18, 2026Updated 2 months ago
Alternatives and similar repositories for CodeMind
Users that are interested in CodeMind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Artifact repository for the paper "Perfect Is the Enemy of Test Oracle", In Proceedings of The 30th ACM Joint European Software Engineeri…☆11May 4, 2023Updated 2 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- ☆11Jul 20, 2021Updated 4 years ago
- ☆11Jul 8, 2024Updated last year
- ☆33Jul 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ungrafting Malicious Code from Piggybacked Android Apps☆14Sep 27, 2016Updated 9 years ago
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆19Feb 18, 2025Updated last year
- This repository contains code and data of the paper **On the Limitations of Continual Learning for Malware Classification**, accepted to …☆19Dec 29, 2023Updated 2 years ago
- ☆29Jan 17, 2024Updated 2 years ago
- MODIT: On Multi-Modal Learning of Editing Source Code.☆20Apr 24, 2021Updated 5 years ago
- Cascade Speculative Drafting☆33Apr 2, 2024Updated 2 years ago
- Agent fixing SWE bench issues☆19May 21, 2024Updated last year
- An empirical study on patch correctness☆15Nov 5, 2022Updated 3 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆43Jan 1, 2025Updated last year
- search and collect windows files from multiple locations on machine and store in one centralized directory☆20Aug 29, 2012Updated 13 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- Code from the paper: Neurlux: Dynamic Malware Analysis Without Feature Engineering☆13Dec 27, 2020Updated 5 years ago
- DependEval: a hierarchical benchmark for evaluating LLMs on repository-level code understanding across 8 programming languages.☆16Jul 28, 2025Updated 9 months ago
- ☆66Sep 13, 2025Updated 7 months ago
- ☆23Nov 10, 2023Updated 2 years ago
- [NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents☆76Mar 16, 2026Updated last month
- Proof of concept code for poisoning code generation models.☆57Dec 6, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Clover: Closed-Loop Verifiable Code Generation☆46May 12, 2025Updated 11 months ago
- ☆12Jul 8, 2023Updated 2 years ago
- [NAACL 2025] Benchmark for Repository-Level Code Generation, focus on Executability, Correctness from Test Cases and Usage of Contexts fr…☆44Jan 8, 2026Updated 3 months ago
- TeCo: an ML+Execution model for test completion☆31Jun 16, 2024Updated last year
- ☆82Mar 30, 2026Updated last month
- This is the repository for the paper Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descripti…☆25Nov 18, 2022Updated 3 years ago
- AgentRE-Bench is an agentic benchmark that evaluates state-of-the-art models on long-horizon reverse engineering tasks, measuring their a…☆52Updated this week
- mBERT is a mutation testing tool that uses a pre-trained language model (CodeBERT) to generate mutants.☆17Aug 20, 2025Updated 8 months ago
- ☆11Dec 23, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- ☆47Apr 7, 2026Updated 3 weeks ago
- enchmarking Large Language Models' Resistance to Malicious Code☆16Apr 23, 2026Updated last week
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap techn…☆14Mar 3, 2024Updated 2 years ago
- For our ICSE21 paper "CURE: Code-Aware Neural Machine Translation for Automatic Program Repair" by Nan Jiang, Thibaud Lutellier, and Lin …☆58Dec 8, 2022Updated 3 years ago
- ☆11Mar 4, 2021Updated 5 years ago