π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory"
β50Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for confaide
Users that are interested in confaide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β20May 27, 2024Updated last year
- Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL β¦β12Feb 11, 2026Updated last month
- π€ Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"β37Oct 12, 2020Updated 5 years ago
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β59May 31, 2024Updated last year
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888β37Jun 10, 2024Updated last year
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Dataseβ¦β13Jun 24, 2024Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β45Jun 13, 2023Updated 2 years ago
- β28Nov 28, 2023Updated 2 years ago
- β24Aug 18, 2023Updated 2 years ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shapingβ10Feb 27, 2020Updated 6 years ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineerβ47May 30, 2024Updated last year
- Code for "CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples" (NDSS 2020)β22Nov 14, 2020Updated 5 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidβ¦β23May 8, 2023Updated 2 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word predβ¦β104Aug 13, 2024Updated last year
- Documentation atβ14Mar 27, 2025Updated 11 months ago
- Private Adaptive Optimization with Side Information (ICML '22)β16Jun 23, 2022Updated 3 years ago
- β27Sep 15, 2024Updated last year
- β16Jan 4, 2022Updated 4 years ago
- β21Jul 21, 2025Updated 8 months ago
- https://icml.cc/virtual/2023/poster/24354β10Aug 15, 2023Updated 2 years ago
- Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"β32Mar 15, 2022Updated 4 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)β19May 13, 2024Updated last year
- The git repository of Modular Prompted Chatbot paperβ35May 24, 2023Updated 2 years ago
- LobotoMl is a set of scripts and tools to assess production deployments of ML servicesβ10May 16, 2022Updated 3 years ago
- The official repository of the paper "On the Exploitability of Instruction Tuning".β70Feb 5, 2024Updated 2 years ago
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detectionβ15May 19, 2025Updated 10 months ago
- Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at ICβ¦β27Apr 29, 2020Updated 5 years ago
- β27Nov 20, 2023Updated 2 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QAβ16May 11, 2022Updated 3 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"β32Sep 13, 2024Updated last year
- Bayesian Active Learning with Fully Bayesian Gaussian Processesβ14Sep 29, 2022Updated 3 years ago
- π§π» Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playingβ¦β21Dec 20, 2024Updated last year
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"β20Feb 10, 2025Updated last year
- Code for paper: "RemovalNet: DNN model fingerprinting removal attack", IEEE TDSC 2023.β10Nov 27, 2023Updated 2 years ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generationβ14Aug 19, 2025Updated 7 months ago
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β22Sep 5, 2023Updated 2 years ago
- Implementation of Self-supervised-Online-Adversarial-Purificationβ13Aug 2, 2021Updated 4 years ago
- Repo for the paper "Bounding Training Data Reconstruction in Private (Deep) Learning".β11Jun 16, 2023Updated 2 years ago
- [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Riskβ15Mar 15, 2025Updated last year