A data construction and evaluation framework to quantify privacy norm awareness of language models (LMs) and emerging privacy risk of LM agents. (NeurIPS 2024 D&B)
☆43Mar 4, 2025Updated last year
Alternatives and similar repositories for PrivacyLens
Users that are interested in PrivacyLens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- use angr to deobfuscation☆10Oct 8, 2019Updated 6 years ago
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆192Mar 22, 2024Updated 2 years ago
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆21Nov 10, 2025Updated 4 months ago
- Bayesian Logistic Regression with Hyper-LASSO priors☆10Dec 14, 2025Updated 3 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆124Dec 4, 2025Updated 3 months ago
- Code for ECML-PKDD 2022 Paper --- CMG: A Class-Mixed Generation Approach to Out-of-Distribution Detection☆12Oct 12, 2022Updated 3 years ago
- Repository about single/multi-agent, robotics, llm/vlm/vla, scientific discovery, etc.☆19Jul 10, 2025Updated 8 months ago
- Official repository for the paper, "FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data", EMNLP 2025 Main…☆16Nov 11, 2025Updated 4 months ago
- Easily identify and label sentence intervals using various taggers.☆16Feb 1, 2017Updated 9 years ago
- EmojiCrypt: Prompt Encryption for Secure Communication with Large Language Models☆23Feb 21, 2024Updated 2 years ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆33Mar 9, 2026Updated 2 weeks ago
- [ICCV 25] Official repository of "Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dial…☆25Dec 6, 2025Updated 3 months ago
- ☆10Mar 9, 2023Updated 3 years ago
- Python implementation of Support Vector Machine (SVM) classifier☆11Oct 11, 2020Updated 5 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆36Oct 15, 2023Updated 2 years ago
- [ACL 2024] Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications☆14May 24, 2024Updated last year
- ☆43Mar 3, 2026Updated 3 weeks ago
- ☆15May 17, 2022Updated 3 years ago
- Python benchmark tool inspired by Geekbench.☆20Feb 21, 2026Updated last month
- ☆25Feb 18, 2026Updated last month
- ☆19May 3, 2025Updated 10 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Jan 9, 2024Updated 2 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆21May 2, 2025Updated 10 months ago
- IEEE TNNLS | GeSeNet: A General Semantic-guided Network with Couple Mask Ensemble for Medical Image Fusion☆21Aug 9, 2023Updated 2 years ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆99Jan 11, 2026Updated 2 months ago
- Code for SIGKDD'2021 paper: Deep Clustering based Fair Outlier Detection☆11Oct 15, 2021Updated 4 years ago
- Implementation of semi-supervised learning: UDA, MixMatch, Mean-teacher, focusing on NLP, powered by Pytorch☆12Jan 6, 2021Updated 5 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Code and Hummingbird dataset for EMNLP 2021 paper "Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica"☆14Apr 13, 2022Updated 3 years ago
- PyTorch implementation for "Temperature as Uncertainty in Contrastive Learning" (https://arxiv.org/abs/2110.04403).☆16Oct 19, 2021Updated 4 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- Source code for NAACL 2022 paper Weakly Supervised Text Classification using Supervision Signals from a Language Mode☆10Jun 13, 2022Updated 3 years ago
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆16Dec 4, 2024Updated last year