Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
☆13Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for llm-security-challenge
Users that are interested in llm-security-challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Repo for the paper on Escalation Risks of AI systems☆44Apr 12, 2024Updated 2 years ago
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆14Jul 30, 2023Updated 2 years ago
- Professional Wargaming LLM Toolbox☆22Jul 9, 2025Updated 9 months ago
- Sample Excel add-in and Python script code to run an agent using LLM from an Excel function☆19Jul 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- ☆13Dec 22, 2023Updated 2 years ago
- Tool, paper, and study data for DeepManeuver: Adversarial Test Generation for Trajectory Manipulation of Autonomous Vehicles.☆12Aug 26, 2023Updated 2 years ago
- Website for PauseAI.info☆25Updated this week
- Fine-tuning of transformers for Sentiment Analysis☆19May 25, 2021Updated 4 years ago
- A web service in PHP that "translates" HackNPlan webhook messages to Discord webhook messages.☆16Feb 23, 2023Updated 3 years ago
- LLM security and privacy☆53Oct 15, 2024Updated last year
- Code for the paper "Understanding RL Vision"☆51Apr 2, 2023Updated 3 years ago
- The Happy Faces Benchmark☆15Jul 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Sep 9, 2021Updated 4 years ago
- Deploy smart and secure conversational agents for your employees, using Azure. Able to use both private and public data.☆61Feb 28, 2024Updated 2 years ago
- ☆12May 6, 2022Updated 3 years ago
- A collection of security papers on top-tier publications☆65Updated this week
- ☆36Updated this week
- The burp extension to forward the request☆10Oct 21, 2024Updated last year
- A Novel Benchmark evaluating the Deep Capability of Vulnerability Detection with Large Language Models☆34Apr 25, 2025Updated 11 months ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆91May 19, 2024Updated last year
- VulnMapAI combines the power of nmap’s detailed network scanning and the advanced natural language processing capabilities of GPT-4 to ge…☆32Oct 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆30Updated this week
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Agent installed on node to launch IDA,Bindiff,... and send results to the server ( AutoDiffWeb )☆10Mar 25, 2016Updated 10 years ago
- ☆11Dec 23, 2024Updated last year
- ☆11May 21, 2019Updated 6 years ago
- [TMLR 2024] On the Adversarial Robustness of Camera-based 3D Object Detection☆31Apr 23, 2024Updated last year
- Repository with research related to Android☆13Jul 17, 2018Updated 7 years ago
- A WordPress plugin that integrates ChatGPT to your website☆13Nov 20, 2023Updated 2 years ago
- Situational Awareness Dataset☆49Dec 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Talking Santa-GPT with Speech Recognition☆15Dec 28, 2023Updated 2 years ago
- ☆119Jan 19, 2026Updated 3 months ago
- Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.☆169Oct 13, 2023Updated 2 years ago
- ☆14Feb 26, 2025Updated last year
- multi agent team with coding and data analysis capability to structure real estate investment plans and help with decision making.☆17Jun 11, 2024Updated last year
- Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"☆29Dec 2, 2025Updated 4 months ago
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago