QinbinLi/LLM-PBE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QinbinLi/LLM-PBE)

QinbinLi / LLM-PBE

A toolkit to assess data privacy in LLMs (under development)

☆75

Alternatives and similar repositories for LLM-PBE

Users that are interested in LLM-PBE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jyhong836 / llm-dp-finetune
View on GitHub
End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP
☆17Sep 23, 2024Updated last year
weichen-yu / LM-Extraction
View on GitHub
☆43May 23, 2023Updated 3 years ago
HKUST-KnowComp / PrivLM-Bench
View on GitHub
Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.
☆16Feb 5, 2025Updated last year
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
microsoft / analysing_pii_leakage
View on GitHub
The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…
☆104Aug 13, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
eth-sri / SynthPAI
View on GitHub
A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)
☆58Jul 27, 2025Updated 11 months ago
zhaojunGUO / Awesome-LLM-Watermark
View on GitHub
Watermarking LLM papers up-to-date
☆12Dec 17, 2023Updated 2 years ago
TangciuYueng / AMemGuard
View on GitHub
☆11Jul 2, 2026Updated 3 weeks ago
zjysteven / mink-plus-plus
View on GitHub
[ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs
☆59May 26, 2025Updated last year
Sripaad / ai4privacy
View on GitHub
☆23Mar 15, 2024Updated 2 years ago
saferlhf-v / saferlhf-v
View on GitHub
☆23Jun 16, 2025Updated last year
GraySwanAI / circuit-breakers
View on GitHub
Improving Alignment and Robustness with Circuit Breakers
☆266Sep 24, 2024Updated last year
Xtra-Computing / OEBench
View on GitHub
OEBench: Investigating Open Environment Challenges in Real-World Relational Data Streams (VLDB 2024)
☆13Aug 27, 2024Updated last year
abertsch72 / long-context-icl
View on GitHub
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆44Aug 20, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
AI45Lab / Fake-Alignment
View on GitHub
☆17Mar 22, 2024Updated 2 years ago
ziweiji / Self_Reflection_Medical
View on GitHub
Code for paper Towards Mitigating LLM Hallucination via Self Reflection
☆30Oct 9, 2023Updated 2 years ago
boyiwei / alignment-attribution-code
View on GitHub
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
☆91Mar 30, 2025Updated last year
SchwinnL / circuit-breakers-eval
View on GitHub
Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting
☆18Apr 15, 2025Updated last year
HKUST-KnowComp / PrivaCI-Bench
View on GitHub
☆23Apr 23, 2025Updated last year
VITA-Group / DP-OPT
View on GitHub
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆48May 30, 2024Updated 2 years ago
Yijia-Xiao / PrivacyMind
View on GitHub
Large Language Models Can Be Contextual Privacy Protection Learners
☆16Oct 28, 2024Updated last year
Xtra-Computing / PathEnum
View on GitHub
Source code of "PathEnum: Towards Real-Time Hop-Constrained s-t Path Enumeration", published in SIGMOD'2021 - By Shixuan Sun, Yuhang Chen…
☆17Mar 23, 2021Updated 5 years ago
Xtra-Computing / ThundeRiNG
View on GitHub
Fast Multiple Independent Random Number Sequences Generation on FPGAs
☆15Sep 19, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
decoding-comp-trust / comp-trust
View on GitHub
Codebase for decoding compressed trust.
☆27May 7, 2024Updated 2 years ago
imperial-aisp / mia_llms_benchmark
View on GitHub
Benchmarking MIAs against LLMs.
☆30Oct 8, 2024Updated last year
SaFo-Lab / DRIFT
View on GitHub
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agen…
☆58Jul 16, 2026Updated last week
Unispac / shallow-vs-deep-alignment
View on GitHub
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
☆190Apr 23, 2025Updated last year
CryptoAILab / JailbreakEval
View on GitHub
[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
☆193Apr 1, 2025Updated last year
dreadnode / research
View on GitHub
General research for Dreadnode
☆29Jun 17, 2024Updated 2 years ago
thu-coai / TransferAttack
View on GitHub
[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
☆19May 23, 2025Updated last year
Xtra-Computing / VertiBench
View on GitHub
Feature partitioner by imbalance or correlation (ICLR 2024)
☆17Mar 25, 2026Updated 4 months ago
ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆684Jun 2, 2026Updated last month
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
usail-hkust / JailTrickBench
View on GitHub
Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)
☆167Nov 30, 2024Updated last year
haidequanbu / ESC-Eval
View on GitHub
[EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“
☆27Jun 24, 2024Updated 2 years ago
wyshi / sdp_transformers
View on GitHub
☆12Jan 5, 2023Updated 3 years ago
javiferran / sae_entities
View on GitHub
☆78Mar 6, 2025Updated last year
CGCL-codes / TransferAttackSurrogates
View on GitHub
The official code of IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferabili…
☆20Aug 22, 2024Updated last year
facebookresearch / ai-agent-privacy
View on GitHub
Dataset and evaluation benchmark for Privacy Leakage Evaluation of Autonomous Web Agents
☆45Apr 18, 2026Updated 3 months ago
Confirm-Solutions / flrt
View on GitHub
Fluent student-teacher redteaming
☆23Jul 25, 2024Updated 2 years ago