NY1024/Foundation-Model-Paper-Notes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NY1024/Foundation-Model-Paper-Notes)

NY1024 / Foundation-Model-Paper-Notes

☆78

Alternatives and similar repositories for Foundation-Model-Paper-Notes

Users that are interested in Foundation-Model-Paper-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

whdii / TMM
View on GitHub
☆21Jan 15, 2024Updated 2 years ago
SensenGao / VLPTransferAttack
View on GitHub
[ECCV2024] Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajector…
☆32Nov 15, 2025Updated 8 months ago
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆61Jun 5, 2024Updated 2 years ago
ffhibnese / CPGC_VLP_Universal_Attacks
View on GitHub
[ICCV-2025] Universal Adversarial Attack, Multimodal Adversarial Attacks, VLP models, Contrastive Learning, Cross-modal Perturbation Gene…
☆36Jul 10, 2025Updated last year
sail-sg / AnyDoor
View on GitHub
AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models
☆61Apr 8, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
itsvaibhav01 / Immune
View on GitHub
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆28Jun 11, 2025Updated last year
ydc123 / MMP-Attack
View on GitHub
Official repository for "On the Multi-modal Vulnerability of Diffusion Models"
☆17Jul 15, 2024Updated 2 years ago
lancopku / agent-backdoor-attacks
View on GitHub
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆115Sep 27, 2024Updated last year
liudaizong / Awesome-LVLM-Attack
View on GitHub
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆567Updated this week
SilverRAN / Adversarial-Attack-Methods-Summary
View on GitHub
Awesome-Adversarial-Attack-Methods-Summary
☆13Jul 24, 2024Updated last year
TreeLLi / APT
View on GitHub
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
☆60Apr 25, 2026Updated 2 months ago
ydyjya / Awesome-LLM-Safety
View on GitHub
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide…
☆1,889Jul 12, 2026Updated last week
UCSC-VLAA / vllm-safety-benchmark
View on GitHub
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆89Nov 28, 2023Updated 2 years ago
rrgeorge-pdcontributions / NSFW-Words-List
View on GitHub
Text file containing NSFW words aggregated from various sources.
☆12Aug 23, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Zoky-2020 / SGA
View on GitHub
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]
☆70Sep 6, 2023Updated 2 years ago
zhaoyiran924 / Probe-Sampling
View on GitHub
[NeurIPS 2024] Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling
☆35Nov 8, 2024Updated last year
lvpeizhuo / MEA-Defender
View on GitHub
This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.
☆29Nov 19, 2023Updated 2 years ago
Junjie-Ye / ToolSword
View on GitHub
[ACL 2024] ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
☆15Sep 12, 2024Updated last year
Haochen-Luo / CroPA
View on GitHub
☆56Dec 7, 2024Updated last year
Unispac / Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
View on GitHub
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
☆281May 13, 2024Updated 2 years ago
NicerWang / Joint-GCG
View on GitHub
[AAAI 2026] Official implementation of "Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems".
☆18Mar 23, 2026Updated 3 months ago
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
View on GitHub
☆63Aug 11, 2024Updated last year
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
erfanshayegani / Jailbreak-In-Pieces
View on GitHub
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆93Jun 6, 2024Updated 2 years ago
sduzpf / UAP_VLP
View on GitHub
Universal Adversarial Perturbations for Vision-Language Pre-trained Models
☆24Aug 8, 2025Updated 11 months ago
wonderNefelibata / Awesome-LRM-Safety
View on GitHub
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …
☆84Updated this week
datar001 / Revealing-Vulnerabilities-in-Stable-Diffusion-via-Targeted-Attacks
View on GitHub
☆11Sep 10, 2024Updated last year
BHui97 / PLeak
View on GitHub
☆82Dec 19, 2024Updated last year
datar001 / Awesome-AD-on-T2IDM
View on GitHub
A collection of resources on attacks and defenses targeting text-to-image diffusion models
☆101Dec 20, 2025Updated 7 months ago
thu-ml / Attack-Bard
View on GitHub
☆108Feb 16, 2024Updated 2 years ago
jiamingzhang94 / AnyAttack
View on GitHub
CVPR 2025 - Anyattack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models
☆74Aug 7, 2025Updated 11 months ago
konpanousis / Adversarial-LWTA-AutoAttack
View on GitHub
☆12May 6, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
euanong / image-hijacks
View on GitHub
Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime
☆56Sep 19, 2023Updated 2 years ago
yxoh / prompt_leak_usenix2024
View on GitHub
☆15May 5, 2026Updated 2 months ago
facebookresearch / text-adversarial-attack
View on GitHub
Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"
☆113Dec 28, 2022Updated 3 years ago
CryptoAILab / Awesome-LM-SSP
View on GitHub
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆2,019Jun 17, 2026Updated last month
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated last year
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated last year
yuhongwei22 / MFA
View on GitHub
(AAAI 24) Step Vulnerability Guided Mean Fluctuation Adversarial Attack against Conditional Diffusion Models
☆11Oct 12, 2024Updated last year