Kekkodf / pypantera
A Python Package for NLP obfuscation using Differential Privacy
☆20Updated last week
Related projects ⓘ
Alternatives and complementary repositories for pypantera
- Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"☆26Updated 2 years ago
- ☆66Updated 2 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆86Updated 3 months ago
- Code for Findings-ACL 2023 paper: Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Rec…☆41Updated 5 months ago
- Differentially-private transformers using HuggingFace and Opacus☆124Updated 2 months ago
- A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code☆63Updated 5 months ago
- A codebase that makes differentially private training of transformers easy.☆160Updated last year
- ☆22Updated 11 months ago
- ☆18Updated 3 years ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆14Updated 4 months ago
- Awesome LLM Jailbreak academic papers☆77Updated last year
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆34Updated 11 months ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer☆32Updated 5 months ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23Updated last year
- Official Repo of ACL 2024 Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`☆45Updated 3 weeks ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆29Updated 3 years ago
- Training data extraction on GPT-2☆177Updated last year
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆27Updated 2 years ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆80Updated 8 months ago
- ☆24Updated 3 months ago
- ☆12Updated last year
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆19Updated 7 months ago
- ☆10Updated 3 months ago
- DP-BART for Privatized Text Rewriting under Local Differential Privacy☆14Updated 3 weeks ago
- The code for paper "The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)", exploring the privacy risk o…☆36Updated 8 months ago
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆94Updated last month
- A collection of automated evaluators for assessing jailbreak attempts.☆75Updated 4 months ago
- Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversaria…☆34Updated last year
- ☆14Updated 3 years ago
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Updated 7 months ago