alipay / private_llmLinks
☆33Updated last year
Alternatives and similar repositories for private_llm
Users that are interested in private_llm are comparing it to the libraries listed below
Sorting:
- Shepherd: A foundational framework enabling federated instruction tuning for large language models☆232Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated 2 years ago
- Implementation for PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs☆22Updated last year
- ☆18Updated last year
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆76Updated 3 weeks ago
- ☆21Updated last year
- Official repo for the paper: Recovering Private Text in Federated Learning of Language Models (in NeurIPS 2022)☆56Updated 2 years ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer☆43Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- LAMP: Extracting Text from Gradients with Language Model Priors (NeurIPS '22)☆23Updated last week
- ☆49Updated last year
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆52Updated 9 months ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆35Updated 11 months ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆142Updated last year
- TrustAgent: Towards Safe and Trustworthy LLM-based Agents☆41Updated 4 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆93Updated last year
- ☆40Updated 2 months ago
- Codebase for decoding compressed trust.☆23Updated last year
- Federated Learning Framework Benchmark (UniFed)☆49Updated last year
- ☆20Updated last year
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23Updated 2 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- ☆26Updated last year
- [ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models☆76Updated last month
- ☆12Updated 2 years ago
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Updated last year
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆61Updated 4 months ago
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆98Updated 10 months ago
- ☆40Updated 2 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year