jyhong836 / llm-dp-finetuneLinks
End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP
☆14Updated last year
Alternatives and similar repositories for llm-dp-finetune
Users that are interested in llm-dp-finetune are comparing it to the libraries listed below
Sorting:
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆23Updated 2 months ago
- A toolkit to assess data privacy in LLMs (under development)☆65Updated 11 months ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer☆46Updated last year
- ☆24Updated last year
- Official repo for the paper: Recovering Private Text in Federated Learning of Language Models (in NeurIPS 2022)☆61Updated 2 years ago
- Implementation of paper 'Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing'☆21Updated last year
- Private Adaptive Optimization with Side Information (ICML '22)☆16Updated 3 years ago
- ☆23Updated last year
- ☆46Updated last year
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆54Updated last month
- ☆48Updated 10 months ago
- The code for paper "The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)", exploring the privacy risk o…☆62Updated 10 months ago
- ☆70Updated 10 months ago
- ☆22Updated 4 months ago
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆60Updated last year
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆102Updated last year
- ☆77Updated 3 years ago
- ☆28Updated 2 years ago
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆76Updated 4 months ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆66Updated last year
- ☆114Updated 2 years ago
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆55Updated 2 months ago
- ☆24Updated 3 years ago
- Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"☆59Updated last year
- ☆24Updated 2 years ago
- A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code☆68Updated last year
- Official codes for "Understanding Deep Gradient Leakage via Inversion Influence Functions", NeurIPS 2023☆16Updated 2 years ago
- ☆32Updated 3 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆101Updated last year
- A curated list of trustworthy Generative AI papers. Daily updating...☆75Updated last year