π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory"
β53Dec 20, 2023Updated 2 years ago
Alternatives and similar repositories for confaide
Users that are interested in confaide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β20May 27, 2024Updated last year
- Official repository of the paper: Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code (Findings of EACL β¦β12Mar 26, 2026Updated 2 weeks ago
- π€ Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"β37Oct 12, 2020Updated 5 years ago
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β59May 31, 2024Updated last year
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888β37Jun 10, 2024Updated last year
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Dataseβ¦β13Jun 24, 2024Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β45Jun 13, 2023Updated 2 years ago
- β28Nov 28, 2023Updated 2 years ago
- β24Aug 18, 2023Updated 2 years ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shapingβ10Feb 27, 2020Updated 6 years ago
- [ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineerβ47May 30, 2024Updated last year
- Code for "CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples" (NDSS 2020)β22Nov 14, 2020Updated 5 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidβ¦β23May 8, 2023Updated 2 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word predβ¦β104Aug 13, 2024Updated last year
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Private Adaptive Optimization with Side Information (ICML '22)β16Jun 23, 2022Updated 3 years ago
- β16Nov 30, 2022Updated 3 years ago
- β27Sep 15, 2024Updated last year
- β16Jan 4, 2022Updated 4 years ago
- β21Apr 3, 2026Updated last week
- Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"β32Mar 15, 2022Updated 4 years ago
- Research simulation toolkit for federated learningβ13Nov 7, 2020Updated 5 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)β19May 13, 2024Updated last year
- β20Oct 28, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hide and Seek (HaS): A Framework for Prompt Privacy Protectionβ54Sep 6, 2023Updated 2 years ago
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detectionβ15Updated this week
- The official repository of the paper "On the Exploitability of Instruction Tuning".β69Feb 5, 2024Updated 2 years ago
- β19Mar 6, 2023Updated 3 years ago
- Machine learning project using federated learning for text generationβ11May 5, 2024Updated last year
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QAβ16May 11, 2022Updated 3 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"β32Sep 13, 2024Updated last year
- Bayesian Active Learning with Fully Bayesian Gaussian Processesβ14Sep 29, 2022Updated 3 years ago
- π§π» Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playingβ¦β21Dec 20, 2024Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β21Sep 21, 2021Updated 4 years ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generationβ14Aug 19, 2025Updated 7 months ago
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β22Sep 5, 2023Updated 2 years ago
- Implementation of Self-supervised-Online-Adversarial-Purificationβ13Aug 2, 2021Updated 4 years ago
- π€ Code for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"β76Mar 22, 2022Updated 4 years ago
- Flow Integrity Deterministic Enforcement System. Mechanisms for securing AI agents with information-flow control.β85May 30, 2025Updated 10 months ago
- β11Jul 7, 2023Updated 2 years ago