☆21May 23, 2025Updated 9 months ago
Alternatives and similar repositories for llm-anonymization
Users that are interested in llm-anonymization are comparing it to the libraries listed below
Sorting:
- ☆71Feb 16, 2025Updated last year
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆20Sep 18, 2025Updated 5 months ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 2 months ago
- ☆23Oct 25, 2024Updated last year
- PoliGraph: Automated Privacy Policy Analysis using Knowledge Graphs☆33Jun 21, 2023Updated 2 years ago
- Code for Voice Jailbreak Attacks Against GPT-4o.☆36May 31, 2024Updated last year
- ☆37Nov 16, 2025Updated 3 months ago
- ☆37Oct 17, 2024Updated last year
- ☆12May 6, 2022Updated 3 years ago
- This repo is for the safety topic, including attacks, defenses and studies related to reasoning and RL☆61Sep 5, 2025Updated 6 months ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 10 months ago
- BrainWash: A Poisoning Attack to Forget in Continual Learning☆12Apr 15, 2024Updated last year
- Training project about Deep Learing☆12Jun 22, 2017Updated 8 years ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆13Mar 1, 2025Updated last year
- Official Repository for Dataset Inference for LLMs☆42Jul 25, 2024Updated last year
- A Survey on Learning to Hash☆10Apr 10, 2018Updated 7 years ago
- A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities a…☆39Updated this week
- ☆20Feb 3, 2025Updated last year
- ☆14Feb 26, 2025Updated last year
- LobotoMl is a set of scripts and tools to assess production deployments of ML services☆10May 16, 2022Updated 3 years ago
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago
- Code for running forward and backward versions of GPT2☆10Nov 20, 2021Updated 4 years ago
- ☆11Dec 8, 2024Updated last year
- The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".☆11Jun 28, 2024Updated last year
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆13Dec 16, 2024Updated last year
- A tool for extracting, modifying, and crafting ASDM binary packages (CVE-2022-20829)☆13Aug 15, 2022Updated 3 years ago
- Javascript Trie experiment 𐄁 ☎️ 𐄁 old school "T9" word prediction☆13Jun 11, 2018Updated 7 years ago
- [ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers☆12Mar 29, 2022Updated 3 years ago
- ☆23Aug 30, 2025Updated 6 months ago
- ☆14Mar 9, 2025Updated 11 months ago
- Code for AISTATS'25 paper - On the Power of Adaptive Weighted Aggregation in Heterogeneous Federated Learning and Beyond☆13Sep 23, 2025Updated 5 months ago
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- ☆13Sep 1, 2025Updated 6 months ago
- [Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis☆10Sep 23, 2021Updated 4 years ago
- ☆22Jun 22, 2025Updated 8 months ago
- ☆10Jun 16, 2022Updated 3 years ago
- [NeurIPS 2024] "Membership Inference on Text-to-image Diffusion Models via Conditional Likelihood Discrepancy"☆12Sep 15, 2025Updated 5 months ago
- ☆11May 18, 2025Updated 9 months ago
- PyTorch Implementation of Weakly Supervised Pre-training - [IJCAI19]☆12May 23, 2020Updated 5 years ago