awisiorek / syntax-2223Links
Materialien für die Vorlesung "Syntax natürlicher Sprachen" im WS 2022/23 (CIS, LMU München)
☆9Updated 2 years ago
Alternatives and similar repositories for syntax-2223
Users that are interested in syntax-2223 are comparing it to the libraries listed below
Sorting:
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆12Updated last year
- ☆18Updated 4 years ago
- Natural Language Attacks in a Hard Label Black Box Setting.☆47Updated 4 years ago
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆43Updated 4 years ago
- ☆74Updated 3 years ago
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆184Updated 2 years ago
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆34Updated last year
- Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)☆24Updated 3 years ago
- Natural Universal Trigger Search (NUTS)☆21Updated 4 years ago
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆27Updated last year
- Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"☆43Updated 2 years ago
- A lightweight library for large laguage model (LLM) jailbreaking defense.☆53Updated 9 months ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Updated 2 years ago
- Interpretable LambdaMART☆11Updated 2 months ago
- A framework for assessing and improving classification fairness.☆33Updated 2 years ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆35Updated last year
- A Unified Approach to Evaluate and Compare Explainable AI methods☆14Updated last year
- ☆140Updated last year
- Code for "Astraea: Grammar-based Fairness Testing"☆10Updated 3 years ago
- AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM☆69Updated 8 months ago
- ☆11Updated 5 years ago
- Imitation Adversarial Attacks for Black-box Neural Ranking Models☆12Updated last year
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆43Updated 2 years ago
- An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)☆42Updated last year
- Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models☆31Updated last month
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated last year
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆107Updated 2 years ago
- Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT☆198Updated 4 years ago
- ☆11Updated 3 years ago
- Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency☆70Updated 2 years ago