This project investigates the security of large language models by performing binary classification of a set of input prompts to discover malicious prompts. Several approaches have been analyzed using classical ML algorithms, a trained LLM model, and a fine-tuned LLM model.
☆63Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for llm-security-prompt-injection
Users that are interested in llm-security-prompt-injection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the Over…☆13Aug 21, 2023Updated 2 years ago
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆25Apr 9, 2026Updated 2 months ago
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆19Feb 20, 2026Updated 3 months ago
- The following is a simple example of how LLMs and langchain agents can simplify asking questions to understand the security posture of a …☆23Aug 23, 2023Updated 2 years ago
- Curated UTF-8 URL-encoded character dictionary for injection testing, fuzzing, and bypass techniques against web applications and APIs, f…☆13Sep 20, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AI-Powered CyberSecurity Compliance: Boost Network Security with OpenAI GPT-3.5-turbo☆10May 18, 2023Updated 3 years ago
- Exploit for CVE-2024-3273, supports single and multiple hosts☆13Apr 7, 2024Updated 2 years ago
- AyedFuzzer is a small File-Format-Fuzzer with 3 options (File-mutating, WinDbg-interactive monitor, multi-processing) for windows executa…☆17Dec 2, 2024Updated last year
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆13Jul 30, 2023Updated 2 years ago
- SNMP Bash Script to discover valid community strings, dump basic information, check for write permission and check for RCE.☆11Apr 27, 2024Updated 2 years ago
- A self-assessment tool by @NC3-LU to help business owners implement a better cybersecurity strategy.☆25Feb 13, 2026Updated 3 months ago
- [USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise a…☆114Oct 11, 2024Updated last year
- Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL …☆12Mar 26, 2026Updated 2 months ago
- Master PDF Summarization with Google Bard☆13Feb 29, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆16Aug 8, 2023Updated 2 years ago
- Whispers in the Machine: Confidentiality in Agentic Systems☆44Apr 20, 2026Updated last month
- ☆20Jun 4, 2023Updated 3 years ago
- Multi-agent AI system using GPT-4o, DeepSeek v3, and Llama 3.3 to detect if CVE vulnerabilities were exploited as zero-days. Analyzes…☆20Feb 13, 2026Updated 3 months ago
- Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.☆169Oct 13, 2023Updated 2 years ago
- Risks and targets for assessing LLMs & LLM vulnerabilities☆35May 27, 2024Updated 2 years ago
- 1990–2021년 한국어 신문 사회면 기사의 ○○女·○○男 집계☆17Sep 26, 2023Updated 2 years ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 3 years ago
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 언어와 컴퓨터 (2021학년도 2학기, 서울대학교 언어학과)☆13Aug 16, 2022Updated 3 years ago
- ☆15May 10, 2023Updated 3 years ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- RuleVis is a powerful analysis tool that transforms your Wazuh ruleset into a dynamic, interactive force-directed graph. It helps you vis…☆26Nov 12, 2025Updated 7 months ago
- Application scanning component of OWASP PurpleTeam☆16Feb 12, 2023Updated 3 years ago
- Training scenarios for cyber ranges☆15Apr 24, 2020Updated 6 years ago
- ☆44Jun 29, 2023Updated 2 years ago
- ☆20Jan 9, 2024Updated 2 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AIxCC: automated vulnerability repair via LLMs, search, and static analysis☆13Jul 16, 2024Updated last year
- Analyse Social Network of co-authors in DBLP website (https://dblp.uni-trier.de) using NetworkX.☆13May 27, 2020Updated 6 years ago
- Small web frontend for using openAI's GPT-3.5 and GPT-4's API☆59Jun 1, 2026Updated last week
- LLM Program Watermarking☆18Apr 19, 2024Updated 2 years ago
- AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks☆66Jan 15, 2026Updated 4 months ago
- AWS CIS Controls module for terraform☆11Nov 16, 2023Updated 2 years ago
- OWASP Foundation Web Respository☆391Updated this week