Fish-and-Sheep / Text-FluoroscopyLinks

☆13

Alternatives and similar repositories for Text-Fluoroscopy

Users that are interested in Text-Fluoroscopy are comparing it to the libraries listed below

Sorting:

ltroin / llm_attack_defense_arena
☆82Updated last year
WUSTL-CSPL / LLMJailbreak
☆34Updated 8 months ago
mengtong0110 / InferDPT
☆29Updated 2 months ago
lancopku / codable-watermarking-for-llm
Repository for Towards Codable Watermarking for Large Language Models
☆37Updated last year
bangawayoo / mb-lm-watermarking
multi-bit language model watermarking (NAACL 24)
☆13Updated 9 months ago
Lyz1213 / BadEdit
☆28Updated 8 months ago
NY1024 / Foundation-Model-Paper-Notes
☆55Updated last month
ThuCCSLab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆148Updated this week
AI45Lab / ActorAttack
☆88Updated 4 months ago
YancyKahn / CoA
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆33Updated 5 months ago
THU-BPM / Robust_Watermark
Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.
☆32Updated 7 months ago
mignonjia / TS_watermark
☆18Updated last month
theshi-1128 / jailbreak-bench
The most comprehensive and accurate LLM jailbreak attack benchmark by far
☆19Updated 3 months ago
Allen-piexl / JailbreakZoo
☆139Updated 9 months ago
roywang021 / UMK
Code for ACM MM2024 paper: White-box Multimodal Jailbreaks Against Large Vision-Language Models
☆28Updated 5 months ago
OSU-NLP-Group / AgentSafety
☆88Updated last month
ShiJiawenwen / JudgeDeceiver
[CCS 2024] Optimization-based Prompt Injection Attack to LLM-as-a-Judge
☆25Updated 7 months ago
THU-KEG / WaterBench
[ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarks
☆26Updated last year
bangawayoo / nlp-watermarking
Robust natural language watermarking using invariant features
☆25Updated last year
bboylyg / BackdoorLLM
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models
☆167Updated this week
abc03570128 / Jailbreaking-Attack-against-Multimodal-Large-Language-Model
☆48Updated 10 months ago
agiresearch / ASB
Agent Security Bench (ASB)
☆89Updated last week
isXinLiu / MM-SafetyBench
Accepted by ECCV 2024
☆139Updated 8 months ago
grasses / PoisonPrompt
Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107
☆17Updated 10 months ago
xingjunm / Awesome-Large-Model-Safety
Safety at Scale: A Comprehensive Survey of Large Model Safety
☆173Updated 4 months ago
BHui97 / PLeak
☆58Updated 6 months ago
xyq7 / GradSafe
Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"
☆57Updated 8 months ago
Kiode / Text_Watermark
Watermarking Text Generated by Black-Box Language Models
☆38Updated last year
abehou / SemStamp
Repo for SemStamp (NAACL2024) and k-SemStamp (ACL2024)
☆20Updated 6 months ago
PurduePAML / DBS
☆16Updated 2 years ago