π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory"
β54Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for confaide
Users that are interested in confaide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β20May 27, 2024Updated last year
- π€ Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"β37Oct 12, 2020Updated 5 years ago
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β61May 31, 2024Updated last year
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888β37Jun 10, 2024Updated last year
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Dataseβ¦β13Jun 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β45Jun 13, 2023Updated 2 years ago
- β28Nov 28, 2023Updated 2 years ago
- β24Aug 18, 2023Updated 2 years ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shapingβ10Feb 27, 2020Updated 6 years ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineerβ47May 30, 2024Updated last year
- Code for "CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples" (NDSS 2020)β22Nov 14, 2020Updated 5 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidβ¦β23May 8, 2023Updated 2 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word predβ¦β104Aug 13, 2024Updated last year
- Private Adaptive Optimization with Side Information (ICML '22)β16Jun 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β16Nov 30, 2022Updated 3 years ago
- β29Sep 15, 2024Updated last year
- β21Apr 3, 2026Updated 3 weeks ago
- https://icml.cc/virtual/2023/poster/24354β10Aug 15, 2023Updated 2 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)β19May 13, 2024Updated last year
- β20Oct 28, 2025Updated 6 months ago
- The git repository of Modular Prompted Chatbot paperβ35May 24, 2023Updated 2 years ago
- Hide and Seek (HaS): A Framework for Prompt Privacy Protectionβ54Sep 6, 2023Updated 2 years ago
- Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"β33Mar 15, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The official repository of the paper "On the Exploitability of Instruction Tuning".β69Feb 5, 2024Updated 2 years ago
- β19Mar 6, 2023Updated 3 years ago
- Codes for reproducing the results of the paper "Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness" published at ICβ¦β27Apr 29, 2020Updated 6 years ago
- β27Nov 20, 2023Updated 2 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QAβ16May 11, 2022Updated 3 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"β32Sep 13, 2024Updated last year
- Bayesian Active Learning with Fully Bayesian Gaussian Processesβ14Sep 29, 2022Updated 3 years ago
- β21Sep 21, 2021Updated 4 years ago
- Code for paper: "RemovalNet: DNN model fingerprinting removal attack", IEEE TDSC 2023.β10Nov 27, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β22Sep 5, 2023Updated 2 years ago
- Implementation of Self-supervised-Online-Adversarial-Purificationβ13Aug 2, 2021Updated 4 years ago
- [SatML 2024] Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Riskβ15Mar 15, 2025Updated last year
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"β22Feb 10, 2025Updated last year
- β13Oct 21, 2021Updated 4 years ago
- Flow Integrity Deterministic Enforcement System. Mechanisms for securing AI agents with information-flow control.β88May 30, 2025Updated 11 months ago
- This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (β¦β14Oct 26, 2023Updated 2 years ago