☆27Nov 20, 2023Updated 2 years ago
Alternatives and similar repositories for toxic-prompt
Users that are interested in toxic-prompt are comparing it to the libraries listed below
Sorting:
- RAG-based chatbot for retail e-commerce.☆31Dec 1, 2024Updated last year
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆19Feb 18, 2025Updated last year
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- Insurance-RAG-Chatbot(IVA): An open-source project featuring a retrieval-augmented chatbot developed using Bedrock, LLM, LangChain, Docke…☆22May 30, 2024Updated last year
- ☆10Dec 30, 2021Updated 4 years ago
- ☆14Jul 26, 2024Updated last year
- ☆11Jan 2, 2020Updated 6 years ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping☆10Feb 27, 2020Updated 6 years ago
- codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19☆15Feb 25, 2020Updated 6 years ago
- ☆13Oct 20, 2022Updated 3 years ago
- Deep Learning (a.k.a. Recent Trends in Machine Learning) course at dsai.asia☆18Apr 21, 2023Updated 2 years ago
- Recommend products or brands to users based on browsing history data☆13Dec 18, 2020Updated 5 years ago
- [CCS'22] SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders☆18Jul 12, 2022Updated 3 years ago
- Github implementation of https://reports.chatclimate.ai/☆23Jun 16, 2025Updated 8 months ago
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆50Dec 20, 2023Updated 2 years ago
- Code for "CloudLeak: Large-Scale Deep Learning Models Stealing Through Adversarial Examples" (NDSS 2020)☆22Nov 14, 2020Updated 5 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 2 years ago
- ☆18Jul 1, 2021Updated 4 years ago
- PAL: Proxy-Guided Black-Box Attack on Large Language Models☆57Aug 17, 2024Updated last year
- Implementation of "Unsupervised Visual Representation Learning by Context Prediction" by C. Doersh, A. Gupta and A. A. Efros☆24Nov 18, 2021Updated 4 years ago
- ☆24Aug 18, 2023Updated 2 years ago
- ☆70Feb 4, 2024Updated 2 years ago
- ☆26Dec 1, 2022Updated 3 years ago
- ☆25Jun 23, 2021Updated 4 years ago
- Code for the paper: Label-Only Membership Inference Attacks☆68Sep 11, 2021Updated 4 years ago
- Prompting Small Language Models for Personalized Cold-Start Recommendation☆31Mar 9, 2024Updated 2 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆28Oct 31, 2022Updated 3 years ago
- TextHide: Tackling Data Privacy in Language Understanding Tasks☆31Apr 19, 2021Updated 4 years ago
- ☆25Nov 14, 2022Updated 3 years ago
- ☆35May 22, 2024Updated last year
- ☆13Feb 17, 2025Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆87Sep 12, 2024Updated last year
- A virtual caregiver system that extracts the expression of mental and physical health states through dialogue-based human-computer intera…☆14Jan 29, 2023Updated 3 years ago
- Codes for MICCAI 2021 Paper: Selective Learning from External Data for CT Image Segmentation☆12Oct 10, 2021Updated 4 years ago
- Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks (IEEE S&P 2024)☆34Jun 29, 2025Updated 8 months ago
- Factor Modeling for radiomics☆12Aug 29, 2025Updated 6 months ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆16Sep 16, 2025Updated 5 months ago
- Tool for testing IPv4 and IPv6 DHCP services☆13Mar 27, 2020Updated 5 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago