Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
☆13Aug 21, 2023Updated 2 years ago
Alternatives and similar repositories for llm-security-challenge
Users that are interested in llm-security-challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whispers in the Machine: Confidentiality in Agentic Systems☆43Dec 11, 2025Updated 3 months ago
- This project aims at giving the best customer service ever using the power of LLM models like GPT.☆10Jun 29, 2023Updated 2 years ago
- AI-Powered CyberSecurity Compliance: Boost Network Security with OpenAI GPT-3.5-turbo☆10May 18, 2023Updated 2 years ago
- Pin files for contextual, codebase-level AI assistance.☆16Jul 11, 2024Updated last year
- Repo for the paper on Escalation Risks of AI systems☆44Apr 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆14Jul 30, 2023Updated 2 years ago
- Risks and targets for assessing LLMs & LLM vulnerabilities☆34May 27, 2024Updated last year
- Professional Wargaming LLM Toolbox☆21Jul 9, 2025Updated 8 months ago
- ☆11Sep 7, 2023Updated 2 years ago
- ☆16Dec 30, 2023Updated 2 years ago
- G2Net Competition☆12Aug 2, 2023Updated 2 years ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 4 months ago
- Decentralized File storage system☆15Oct 29, 2023Updated 2 years ago
- ☆14Mar 31, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆17Aug 8, 2023Updated 2 years ago
- Multiplayer JS game platform☆16Oct 16, 2017Updated 8 years ago
- Example fNIRS BIDS dataset☆14Nov 4, 2022Updated 3 years ago
- ☆20Jun 4, 2023Updated 2 years ago
- 🔥 A repository for collecting cyberdefense thoughts, books, and documents about AI cyberdefense☆13Jul 2, 2023Updated 2 years ago
- 📚📚📚📚📚📚📚📚📚 Reading everything☆15Mar 11, 2026Updated 2 weeks ago
- 🧠 Inspecting complexity and goal-directedness of imagination in an fNIRS BCI system.☆11Aug 26, 2023Updated 2 years ago
- Sample Excel add-in and Python script code to run an agent using LLM from an Excel function☆19Jul 16, 2024Updated last year
- Methods 2: The General Linear Model☆15May 5, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- New York Times Article Summarization Tool☆17Sep 15, 2019Updated 6 years ago
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- ☆22Jul 18, 2024Updated last year
- ☆12Dec 22, 2023Updated 2 years ago
- ☆15May 10, 2023Updated 2 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Tool, paper, and study data for DeepManeuver: Adversarial Test Generation for Trajectory Manipulation of Autonomous Vehicles.☆11Aug 26, 2023Updated 2 years ago
- The following is a simple example of how LLMs and langchain agents can simplify asking questions to understand the security posture of a …☆23Aug 23, 2023Updated 2 years ago
- Website for PauseAI.info☆24Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- HackMKV - Generate Malacious Video.MKV To Remotely Hack Computer Via Video Files.☆13Jun 11, 2024Updated last year
- Fine-tuning of transformers for Sentiment Analysis☆19May 25, 2021Updated 4 years ago
- A web service in PHP that "translates" HackNPlan webhook messages to Discord webhook messages.☆16Feb 23, 2023Updated 3 years ago
- LLMs for Wargames☆19Sep 21, 2024Updated last year
- This project investigates the security of large language models by performing binary classification of a set of input prompts to discover…☆60Dec 18, 2023Updated 2 years ago
- ☆33Updated this week
- LLM security and privacy☆54Oct 15, 2024Updated last year