alipay / private_llmLinks
☆35Updated last year
Alternatives and similar repositories for private_llm
Users that are interested in private_llm are comparing it to the libraries listed below
Sorting:
- Shepherd: A foundational framework enabling federated instruction tuning for large language models☆246Updated 2 years ago
- Federated Learning for LLMs.☆233Updated 10 months ago
- Hide and Seek (HaS): A Framework for Prompt Privacy Protection☆48Updated 2 years ago
- FedJudge: Federated Legal Large Language Model☆36Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated 2 years ago
- ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]☆211Updated last year
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆70Updated 2 years ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer☆46Updated last year
- Official repo for the paper: Recovering Private Text in Federated Learning of Language Models (in NeurIPS 2022)☆60Updated 2 years ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆89Updated 4 months ago
- Implementation for PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs☆24Updated last year
- A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code☆68Updated last year
- ☆160Updated 8 months ago
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆160Updated last year
- ☆23Updated last year
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆73Updated 7 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆250Updated 11 months ago
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆323Updated last year
- A curated list of Model Merging methods.☆92Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆233Updated last year
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆63Updated 8 months ago
- Implement of Implicit Knowledge Extraction Attack.☆15Updated 4 months ago
- Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)☆149Updated 10 months ago
- S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models☆98Updated 3 months ago
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆357Updated 10 months ago
- On Memorization of Large Language Models in Logical Reasoning☆72Updated 6 months ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆36Updated last year
- ☆31Updated last year
- ☆115Updated last year
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆211Updated last year