Official implementation of Visco-Attack (EMNLP 2025 Main). An open-source one-click reproduction script is also provided.
☆30Apr 11, 2026Updated 2 months ago
Alternatives and similar repositories for Visco-Attack
Users that are interested in Visco-Attack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).☆37Mar 22, 2026Updated 3 months ago
- Diagnostic Framework for LLMs and MLLMs☆39Mar 2, 2026Updated 4 months ago
- Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models☆32Oct 6, 2025Updated 8 months ago
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆62Jul 21, 2025Updated 11 months ago
- ☆30May 22, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repository for the work of the CoSAI Technical Steering Committee (TSC)☆25Jun 25, 2026Updated last week
- A lifecycle guard skill.☆179Mar 27, 2026Updated 3 months ago
- Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"☆35Dec 6, 2024Updated last year
- ☆11Oct 25, 2024Updated last year
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆79Jan 16, 2025Updated last year
- Code implementation for paper "Can Large Language Models Empower Molecular Property Prediction?"☆39Jul 14, 2023Updated 2 years ago
- Official repository of Siggraph Asia 2025 paper "LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representa…☆26Dec 24, 2025Updated 6 months ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆75May 29, 2026Updated last month
- Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechan…☆16Aug 6, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Apr 6, 2025Updated last year
- ☆16Oct 18, 2023Updated 2 years ago
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆24Mar 18, 2026Updated 3 months ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated 2 years ago
- Code of paper "AdvReverb: AdvReverb: Rethinking the Stealthiness of Audio Adversarial Examples to Human Perception"☆21Nov 26, 2023Updated 2 years ago
- 北京邮电大学生存指南,从沙河到本部,从入学到毕业的全程陪伴☆48Updated this week
- The reinforcement learning codes for dataset SPA-VL☆47Jun 24, 2024Updated 2 years ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆176Mar 8, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆29Jul 16, 2024Updated last year
- The official code of our CVPR 2024 paper, "3D Human Pose Perception from Egocentric Stereo Videos".☆28Dec 12, 2025Updated 6 months ago
- ☆25Mar 9, 2025Updated last year
- ☆12Mar 24, 2023Updated 3 years ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆212Mar 4, 2026Updated 4 months ago
- ☆13Jan 25, 2025Updated last year
- zotero-pdf2zh 的 Homebrew 安装脚本,让你可以轻松在本地部署 Zotero PDF 翻译服务器☆58May 11, 2026Updated last month
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- ☆56Dec 7, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official code for PLoP☆20Mar 6, 2026Updated 3 months ago
- Linux进程间通信(消息队列/信号量+共享内存)☆19Jun 8, 2018Updated 8 years ago
- REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective☆24Feb 28, 2025Updated last year
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆18Jul 11, 2025Updated 11 months ago
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆80Nov 27, 2025Updated 7 months ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated last year
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆58Aug 24, 2025Updated 10 months ago