awisiorek / syntax-2223Links

Materialien für die Vorlesung "Syntax natürlicher Sprachen" im WS 2022/23 (CIS, LMU München)

☆9

Alternatives and similar repositories for syntax-2223

Users that are interested in syntax-2223 are comparing it to the libraries listed below

Sorting:

AAAAAAsuka / llm_defends
code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"
☆12Updated last year
ShannonAI / backdoor_nlg
☆18Updated 4 years ago
RishabhMaheshwary / hard-label-attack
Natural Language Attacks in a Hard Label Black Box Setting.
☆47Updated 4 years ago
cookielee77 / CLARE
Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021
☆43Updated 4 years ago
huseyinatahaninan / Differentially-Private-Fine-tuning-of-Language-Models
☆74Updated 3 years ago
thunlp / OpenBackdoor
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
☆184Updated 2 years ago
HKUST-KnowComp / LLM-Multistep-Jailbreak
Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT
☆34Updated last year
lancopku / SOS
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
☆24Updated 3 years ago
Hsuan-Tung / universal_attack_natural_trigger
Natural Universal Trigger Search (NUTS)
☆21Updated 4 years ago
thu-coai / JailbreakDefense_GoalPriority
[ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
☆27Updated last year
thunlp / StyleAttack
Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"
☆43Updated 2 years ago
YihanWang617 / llm-jailbreaking-defense
A lightweight library for large laguage model (LLM) jailbreaking defense.
☆53Updated 9 months ago
dugu9sword / dne
ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble
☆18Updated 2 years ago
veneres / ilmart
Interpretable LambdaMART
☆11Updated 2 months ago
Libr-AI / fairlib
A framework for assessing and improving classification fairness.
☆33Updated 2 years ago
princeton-nlp / corpus-poisoning
[EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156
☆35Updated last year
Karim-53 / Compare-xAI
A Unified Approach to Evaluate and Compare Explainable AI methods
☆14Updated last year
i-gallegos / Fair-LLM-Benchmark
☆140Updated last year
sakshiudeshi / Astraea
Code for "Astraea: Grammar-based Fairness Testing"
☆10Updated 3 years ago
OSU-NLP-Group / AmpleGCG
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
☆69Updated 8 months ago
LiKev12 / CSE544T-Project-TextBugger
☆11Updated 5 years ago
LauJames / PAT
Imitation Adversarial Attacks for Black-box Neural Ranking Models
☆12Updated last year
thunlp / HiddenKiller
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
☆43Updated 2 years ago
rotaryhammer / code-autodan
An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)
☆42Updated last year
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆31Updated last month
RockyLzy / TextDefender
codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"
☆31Updated last year
facebookresearch / text-adversarial-attack
Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"
☆107Updated 2 years ago
LinyangLee / BERT-Attack
Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT
☆198Updated 4 years ago
kangjie-chen / BadPre
☆11Updated 3 years ago
JHL-HUST / PWWS
Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency
☆70Updated 2 years ago