gaborvecsei / Neural-Network-Steganography
Hide some secret ๐ data in a Neural Network - text, malicious software or watermark your NN
โ41Updated 2 years ago
Related projects: โ
- โ27Updated this week
- A repository of Language Model Vulnerabilities and Exposures (LVEs).โ103Updated 6 months ago
- โ58Updated 2 months ago
- A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).โ116Updated 8 months ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.โ100Updated 3 months ago
- โ13Updated this week
- โ34Updated this week
- Whispers in the Machine: Confidentiality in LLM-integrated Systemsโ28Updated last week
- Red-Teaming Language Models with DSPyโ116Updated 5 months ago
- Demonstrates iterative FGSM on Apple's NeuralHash model.โ15Updated 3 years ago
- Payloads for Attacking Large Language Modelsโ56Updated 2 months ago
- PhD/MSc course on Machine Learning Security (Univ. Cagliari)โ190Updated this week
- Tools and our test data developed for the HackAPrompt 2023 competitionโ28Updated 11 months ago
- The Privacy Adversarial Framework (PAF) is a knowledge base of privacy-focused adversarial tactics and techniques. PAF is heavily inspireโฆโ53Updated last year
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".โ81Updated 6 months ago
- Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Modelsโ Safety through Red Teaming"โ26Updated 2 months ago
- gradient-based symbolic execution engine implemented from scratchโ35Updated 9 months ago
- CTF challenges designed and implemented in machine learning applicationsโ99Updated 3 weeks ago
- โ22Updated this week
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]โ181Updated last month
- Red-teaming LLM applications.โ20Updated 2 months ago
- โ10Updated 2 months ago
- Code to break Llama Guardโ27Updated 9 months ago
- Tree of Attacks (TAP) Jailbreaking Implementationโ88Updated 7 months ago
- The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.โ19Updated 2 weeks ago
- โ91Updated last month
- Simple Model Similarities Analysisโ20Updated 7 months ago
- ComPromptMized: Unleashing Zero-click Worms that Target GenAI-Powered Applicationsโ189Updated 6 months ago
- LLM security and privacyโ38Updated 5 months ago
- Reversal Curse Experimentโ13Updated 11 months ago