[NeurIPS'24] RedCode: Risky Code Execution and Generation Benchmark for Code Agents
☆66Nov 14, 2025Updated 3 months ago
Alternatives and similar repositories for RedCode
Users that are interested in RedCode are comparing it to the libraries listed below
Sorting:
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆27Mar 15, 2025Updated 11 months ago
- [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"☆199Apr 12, 2025Updated 10 months ago
- Backdooring Neural Code Search☆14Sep 8, 2023Updated 2 years ago
- ☆18Aug 15, 2022Updated 3 years ago
- 🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025☆35Aug 24, 2025Updated 6 months ago
- ☆19Mar 9, 2024Updated last year
- ☆26Dec 1, 2022Updated 3 years ago
- Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing☆14Feb 18, 2021Updated 5 years ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- Agent Security Bench (ASB)☆186Oct 27, 2025Updated 4 months ago
- Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆22Jul 8, 2024Updated last year
- ☆37Oct 2, 2024Updated last year
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"☆18Mar 10, 2025Updated 11 months ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Updated this week
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆20Jan 27, 2024Updated 2 years ago
- ☆33Dec 9, 2025Updated 2 months ago
- [ICML 2025] UDora: A Unified Red Teaming Framework against LLM Agents☆32Jun 24, 2025Updated 8 months ago
- ☆15Dec 29, 2023Updated 2 years ago
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆36Oct 15, 2023Updated 2 years ago
- AIR-Bench 2024 is a safety benchmark that aligns with emerging government regulations and company policies☆28Aug 14, 2024Updated last year
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆18Apr 15, 2025Updated 10 months ago
- [IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models☆20Oct 5, 2025Updated 4 months ago
- The collection of papers about Private Evolution☆18Oct 7, 2025Updated 4 months ago
- The official repository of the paper "The Digital Cybersecurity Expert: How Far Have We Come?" presented in IEEE S&P 2025☆24May 21, 2025Updated 9 months ago
- ☆20Feb 11, 2024Updated 2 years ago
- [NDSS'23] BEAGLE: Forensics of Deep Learning Backdoor Attack for Better Defense☆17May 7, 2024Updated last year
- Composite Backdoor Attacks Against Large Language Models☆22Apr 12, 2024Updated last year
- ☆126Jul 14, 2024Updated last year
- Code for generating adversarial color-shifted images☆19Nov 11, 2019Updated 6 years ago
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆24Aug 20, 2025Updated 6 months ago
- A curated list of amazingly libraries, services and resources to work with PDF files☆16Jan 28, 2026Updated 3 weeks ago
- A toolbox for backdoor attacks.☆23Jan 13, 2023Updated 3 years ago
- ☆16Dec 3, 2024Updated last year
- Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"☆27Sep 11, 2024Updated last year
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆56Mar 22, 2025Updated 11 months ago
- ☆23Oct 11, 2024Updated last year
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆104Aug 13, 2024Updated last year
- Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"☆61Aug 8, 2024Updated last year
- Papers and resources related to the security and privacy of LLMs 🤖☆566Jun 8, 2025Updated 8 months ago