Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
☆13Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for llm-security-challenge
Users that are interested in llm-security-challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whispers in the Machine: Confidentiality in Agentic Systems☆45Apr 20, 2026Updated last month
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Repo for the paper on Escalation Risks of AI systems☆44Apr 12, 2024Updated 2 years ago
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆13Jul 30, 2023Updated 2 years ago
- Risks and targets for assessing LLMs & LLM vulnerabilities☆35May 27, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Professional Wargaming LLM Toolbox☆28Jul 9, 2025Updated 11 months ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 6 months ago
- ☆14Mar 31, 2024Updated 2 years ago
- Multiplayer JS game platform☆16Oct 16, 2017Updated 8 years ago
- Example fNIRS BIDS dataset☆15Nov 4, 2022Updated 3 years ago
- 🔥 A repository for collecting cyberdefense thoughts, books, and documents about AI cyberdefense☆13Jul 2, 2023Updated 2 years ago
- 📚📚📚📚📚📚📚📚📚 Reading everything☆16Mar 11, 2026Updated 3 months ago
- Sample Excel add-in and Python script code to run an agent using LLM from an Excel function☆20Jul 16, 2024Updated last year
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22Jul 18, 2024Updated last year
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Tool, paper, and study data for DeepManeuver: Adversarial Test Generation for Trajectory Manipulation of Autonomous Vehicles.☆12Aug 26, 2023Updated 2 years ago
- Website for PauseAI.info☆28Jun 12, 2026Updated last week
- This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…☆63Dec 18, 2023Updated 2 years ago
- LLM security and privacy☆54Oct 15, 2024Updated last year
- MCP server for Oura API integration☆37Feb 27, 2025Updated last year
- The Happy Faces Benchmark☆15Jul 20, 2023Updated 2 years ago
- A collection of security papers on top-tier publications☆67Jun 8, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Rearrrange data by a set of methods☆23Mar 6, 2025Updated last year
- LLMs for Wargames☆23Sep 21, 2024Updated last year
- The burp extension to forward the request☆10Oct 21, 2024Updated last year
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆91May 19, 2024Updated 2 years ago
- ☆36May 13, 2026Updated last month
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- github信息泄露搜集工具。GSIL升级版,去除发邮件方式,将结果保存在本地☆13Mar 20, 2021Updated 5 years ago
- ☆41Updated this week
- Agent installed on node to launch IDA,Bindiff,... and send results to the server ( AutoDiffWeb )☆10Mar 25, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11May 21, 2019Updated 7 years ago
- Sentida☆22Dec 14, 2021Updated 4 years ago
- [TMLR 2024] On the Adversarial Robustness of Camera-based 3D Object Detection☆31Apr 23, 2024Updated 2 years ago
- Repository with research related to Android☆13Jul 17, 2018Updated 7 years ago
- Python library for writing Compute Modules☆14Jun 11, 2026Updated last week
- Situational Awareness Dataset☆51Dec 14, 2024Updated last year
- ☆125Jun 10, 2026Updated last week