Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
☆13Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for llm-security-challenge
Users that are interested in llm-security-challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whispers in the Machine: Confidentiality in Agentic Systems☆44Apr 20, 2026Updated last month
- Professional Wargaming LLM Toolbox☆28Jul 9, 2025Updated 10 months ago
- ☆11Sep 7, 2023Updated 2 years ago
- ☆14Mar 31, 2024Updated 2 years ago
- Sample Excel add-in and Python script code to run an agent using LLM from an Excel function☆20Jul 16, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- New York Times Article Summarization Tool☆17Sep 15, 2019Updated 6 years ago
- ☆13Dec 22, 2023Updated 2 years ago
- Agent-Friendly Web Principles☆33Oct 15, 2025Updated 7 months ago
- This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…☆63Dec 18, 2023Updated 2 years ago
- LLM security and privacy☆54Oct 15, 2024Updated last year
- ☆22Sep 9, 2021Updated 4 years ago
- ☆12May 6, 2022Updated 4 years ago
- Tools for exploring Transformer neuron behaviour, including input pruning and diversification.☆23Sep 28, 2023Updated 2 years ago
- A collection of security papers on top-tier publications☆67May 18, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rearrrange data by a set of methods☆23Mar 6, 2025Updated last year
- The burp extension to forward the request☆10Oct 21, 2024Updated last year
- Agent installed on node to launch IDA,Bindiff,... and send results to the server ( AutoDiffWeb )☆10Mar 25, 2016Updated 10 years ago
- ☆11May 21, 2019Updated 7 years ago
- This repository contains various shell scripts and tips and tricks used for packaging androidtamer packages☆14Jul 10, 2022Updated 3 years ago
- [TMLR 2024] On the Adversarial Robustness of Camera-based 3D Object Detection☆31Apr 23, 2024Updated 2 years ago
- Repository with research related to Android☆13Jul 17, 2018Updated 7 years ago
- Python library for writing Compute Modules☆14Feb 17, 2026Updated 3 months ago
- ☆10Jun 29, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.☆169Oct 13, 2023Updated 2 years ago
- ☆15Feb 26, 2025Updated last year
- multi agent team with coding and data analysis capability to structure real estate investment plans and help with decision making.☆18Jun 11, 2024Updated last year
- ☆14Sep 21, 2025Updated 8 months ago
- Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"☆30Dec 2, 2025Updated 5 months ago
- A fast and simple WebSocket relay, built in Rust, that enables a peer-to-peer-like network communication.☆15Aug 10, 2024Updated last year
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Collection of scanner checks missing in Burp☆15Apr 22, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆32Dec 11, 2024Updated last year
- ☆26Aug 1, 2024Updated last year
- Internal Consistency Regularization (CROW) for LLM Backdoor Elimination - Paper accepted to ICML 2025☆16May 6, 2025Updated last year
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- [Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis☆10Sep 23, 2021Updated 4 years ago
- 👀 All-Seeing Eye: Arbitrary File Read Vulnerability in Chrome Versions Prior to 116☆10Nov 18, 2023Updated 2 years ago
- ☆15Mar 9, 2025Updated last year