AI-secure/RedCode

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI-secure/RedCode)

AI-secure / RedCode

[NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents

☆85

Alternatives and similar repositories for RedCode

Users that are interested in RedCode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI-secure / MMDT
View on GitHub
Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models
☆29Mar 15, 2025Updated last year
Lyz1213 / Backdoored_PPLM
View on GitHub
☆15Dec 12, 2023Updated 2 years ago
AI-secure / AgentPoison
View on GitHub
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆230Jun 17, 2026Updated last month
wssun / BADCODE
View on GitHub
Backdooring Neural Code Search
☆14Sep 8, 2023Updated 2 years ago
purpcode-uiuc / purpcode
View on GitHub
🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025
☆40Aug 24, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
microsoft / CodeGenerationPoisoning
View on GitHub
Proof of concept code for poisoning code generation models.
☆59Dec 6, 2023Updated 2 years ago
agiresearch / ASB
View on GitHub
Agent Security Bench (ASB)
☆271Apr 16, 2026Updated 3 months ago
facebookresearch / rl-injector
View on GitHub
Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks
☆53May 6, 2026Updated 2 months ago
datasec-lab / CodeBreaker
View on GitHub
[USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…
☆60Mar 22, 2025Updated last year
PurduePAML / DBS
View on GitHub
☆18Aug 15, 2022Updated 3 years ago
ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆670Jun 2, 2026Updated last month
eth-sri / sven
View on GitHub
☆131Jul 14, 2024Updated 2 years ago
chichidd / llm-lora-trojan
View on GitHub
Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"
☆33Sep 11, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ErxinYu / CoSafe-Dataset
View on GitHub
☆13Nov 12, 2024Updated last year
reza321 / T-Miner
View on GitHub
☆19Mar 9, 2024Updated 2 years ago
AI-secure / PolyGuard
View on GitHub
☆23Jun 18, 2025Updated last year
uiuc-kang-lab / AdaptiveAttackAgent
View on GitHub
☆38Mar 12, 2025Updated last year
MiracleHH / CBA
View on GitHub
Composite Backdoor Attacks Against Large Language Models
☆25Apr 12, 2024Updated 2 years ago
wssun / KillBadCode
View on GitHub
Show Me Your Code! Kill Code Poisoning: A Lightweight Method Based on Code Naturalness
☆19Jul 17, 2025Updated last year
zhenxianglance / RE-paper
View on GitHub
Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing
☆15Feb 18, 2021Updated 5 years ago
stanford-crfm / air-bench-2024
View on GitHub
AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies
☆30Aug 14, 2024Updated last year
wearetyomsmnv / Awesome-LLM-agent-Security
View on GitHub
All about llm-agents security,attack,vulnerabilities and how to do them for cybersecurity.
☆54Jul 11, 2026Updated last week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Gwinhen / MOTH
View on GitHub
This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…
☆11Aug 24, 2022Updated 3 years ago
uw-nsl / safechain
View on GitHub
[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
☆30Apr 2, 2025Updated last year
guardagent / code
View on GitHub
☆47Dec 9, 2025Updated 7 months ago
AI-secure / UDora
View on GitHub
[ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents
☆37Jun 24, 2025Updated last year
ucsb-mlsec / SeCodePLT
View on GitHub
☆15Sep 24, 2025Updated 9 months ago
jinyuan-jia / BadEncoder
View on GitHub
☆84Aug 3, 2021Updated 4 years ago
OSU-NLP-Group / EIA_against_webagent
View on GitHub
☆40Oct 2, 2024Updated last year
PurCL / ProSec
View on GitHub
Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"
☆18Feb 26, 2026Updated 4 months ago
compsec-snu / pfi
View on GitHub
PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agents
☆30Mar 26, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SchwinnL / circuit-breakers-eval
View on GitHub
Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting
☆18Apr 15, 2025Updated last year
EchoSafe-MLLM / EchoSafe
View on GitHub
[CVPR 2026] Code for Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
☆15Mar 18, 2026Updated 4 months ago
ryoungj / ToolEmu
View on GitHub
[ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use
☆212Mar 22, 2024Updated 2 years ago
kaijiezhu11 / MELON
View on GitHub
[ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
☆36Jul 31, 2025Updated 11 months ago
haoyuwang99 / AgentSpec
View on GitHub
☆45Jan 15, 2026Updated 6 months ago
clearloveclearlove / BEAT
View on GitHub
☆15Feb 26, 2025Updated last year
AI-secure / DecodingTrust
View on GitHub
A Comprehensive Assessment of Trustworthiness in GPT Models
☆314Sep 16, 2024Updated last year