Co1lin / CWEvalLinks
Simultaneous evaluation on both functionality and security of LLM-generated code.
☆31Updated 2 months ago
Alternatives and similar repositories for CWEval
Users that are interested in CWEval are comparing it to the libraries listed below
Sorting:
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Updated 10 months ago
- ☆51Updated last year
- ☆86Updated 5 months ago
- ☆21Updated last year
- Backdooring Neural Code Search☆14Updated 2 years ago
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆230Updated 2 weeks ago
- Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"☆27Updated last year
- ☆127Updated last year
- ☆13Updated last year
- [TOSEM 2023] A Survey of Learning-based Automated Program Repair☆75Updated last year
- enchmarking Large Language Models' Resistance to Malicious Code☆14Updated last year
- [2023 TDSC] Pre-trained Model-based Automated Software Vulnerability Repair: How Far are We?☆25Updated 2 years ago
- [NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents☆65Updated 2 months ago
- Repo-Level Code generation papers☆232Updated last month
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆55Updated 2 weeks ago
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆56Updated 10 months ago
- ☆37Updated last year
- ☆18Updated 8 months ago
- ☆121Updated last year
- 🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025☆35Updated 5 months ago
- [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"☆197Updated 9 months ago
- Adversarial Attack for Pre-trained Code Models☆10Updated 3 years ago
- Official Implementation of NeurIPS 2024 paper - BiScope: AI-generated Text Detection by Checking Memorization of Preceding Tokens☆28Updated 3 weeks ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆63Updated 3 months ago
- Implementation for "RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content"☆22Updated last year
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆35Updated 2 years ago
- A Systematic Literature Review on Large Language Models for Automated Program Repair☆228Updated 3 weeks ago
- Replication Package for "Natural Attack for Pre-trained Models of Code", ICSE 2022☆51Updated 3 months ago
- Enhacing Code Pre-trained Models by Contrastive Learning☆38Updated 2 years ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆85Updated last year