Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".
☆59Apr 1, 2026Updated last week
Alternatives and similar repositories for Meta_SecAlign
Users that are interested in Meta_SecAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of the WASP web agent security benchmark☆79Aug 12, 2025Updated 7 months ago
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆94Jul 24, 2025Updated 8 months ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆18Sep 16, 2025Updated 6 months ago
- pytorch reimplementation for Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain☆11Oct 30, 2022Updated 3 years ago
- ☆41Jul 19, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆20Dec 6, 2024Updated last year
- ☆55Mar 18, 2026Updated 3 weeks ago
- Website & Documentation: https://sbaresearch.github.io/model-watermarking/☆25Sep 22, 2023Updated 2 years ago
- Dataset and evaluation benchmark for Privacy Leakage Evaluation of Autonomous Web Agents☆37Mar 26, 2026Updated 2 weeks ago
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆70Jan 19, 2026Updated 2 months ago
- Adversarial Examples Detection Benchmark☆17Dec 6, 2024Updated last year
- [NeurIPS'24] Protecting Your LLMs with Information Bottleneck☆27Nov 7, 2024Updated last year
- Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks☆38May 25, 2021Updated 4 years ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆51Mar 25, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.☆515Mar 30, 2026Updated last week
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆74Nov 27, 2025Updated 4 months ago
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆70Oct 23, 2024Updated last year
- WAFFLE: Watermarking in Federated Learning☆23Aug 21, 2023Updated 2 years ago
- A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities a…☆47Apr 2, 2026Updated last week
- [ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments☆47Feb 9, 2026Updated 2 months ago
- ☆52May 24, 2023Updated 2 years ago
- ☆37Oct 2, 2024Updated last year
- ☆20Feb 3, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository contains all the notes I took in the learning process of all the technologies during my study! 这个仓库记录了我在本科期间学习各类技术的过程中记录…☆21Mar 14, 2023Updated 3 years ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- ☆24Apr 14, 2019Updated 6 years ago
- Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873☆181May 6, 2024Updated last year
- ☆15Feb 11, 2025Updated last year
- [ICCV 2023] "TRM-UAP: Enhancing the Transferability of Data-Free Universal Adversarial Perturbation via Truncated Ratio Maximization", Yi…☆13Jul 17, 2024Updated last year
- [TDSC 2024] Official code for our paper "FedTracker: Furnishing Ownership Verification and Traceability for Federated Learning Model"☆23May 14, 2025Updated 10 months ago
- Large language model of Medical AI, General Medical AI (GMAI)☆17Jan 30, 2024Updated 2 years ago
- ☆25Mar 26, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆129Jul 2, 2024Updated last year
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆111Dec 28, 2022Updated 3 years ago
- ☆10Jun 1, 2022Updated 3 years ago
- Enemies for your LLM☆35Jan 20, 2026Updated 2 months ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- High performance SRMD implementation using CUDA.☆28Mar 28, 2023Updated 3 years ago
- reddit's python experiments framework☆12Apr 28, 2025Updated 11 months ago