annjawn / llm-safety-privacyView external linksLinks
Safety and privacy with LLMs
☆14Sep 25, 2023Updated 2 years ago
Alternatives and similar repositories for llm-safety-privacy
Users that are interested in llm-safety-privacy are comparing it to the libraries listed below
Sorting:
- A multi-purpose LLM framework for RAG and data creation.☆629Jan 13, 2024Updated 2 years ago
- ☆11Dec 23, 2024Updated last year
- BrainWash: A Poisoning Attack to Forget in Continual Learning☆12Apr 15, 2024Updated last year
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 10 months ago
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 2 years ago
- Code for AISTATS'25 paper - On the Power of Adaptive Weighted Aggregation in Heterogeneous Federated Learning and Beyond☆13Sep 23, 2025Updated 4 months ago
- This is a project based on opencv-python which estimates height of an object based upon its picture. It uses a the height reference of a …☆10Dec 11, 2020Updated 5 years ago
- ☆13Mar 9, 2025Updated 11 months ago
- ☆19Jul 21, 2025Updated 6 months ago
- SemBleu: A Robust Metric for AMR Parsing Evaluation☆12Feb 22, 2021Updated 4 years ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆12Jan 12, 2026Updated last month
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 4 months ago
- ☆12Jun 15, 2024Updated last year
- ☆10Jun 12, 2019Updated 6 years ago
- code for unsupervised entity resolution☆10Apr 26, 2019Updated 6 years ago
- [ICLR 2022] Boosting Randomized Smoothing with Variance Reduced Classifiers☆12Mar 29, 2022Updated 3 years ago
- A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities a…☆38Updated this week
- ☆14Feb 26, 2025Updated 11 months ago
- Bundle of security analysis scripts for keras tensorflow models☆15Apr 15, 2024Updated last year
- ☆11Oct 9, 2023Updated 2 years ago
- PRSA: Prompt Stealing Attacks against Real-World Prompt Services (USENIX Security '25)☆24Dec 25, 2025Updated last month
- ☆10Jun 29, 2020Updated 5 years ago
- Face recognition with loss of softmax, sphereface, cosface, arcface in pytorch of python3☆10Apr 27, 2020Updated 5 years ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆19Sep 18, 2025Updated 4 months ago
- Unofficial Iranian hackers group disk wiper malware aka "Shamoon" in .NET 2.0☆13Dec 23, 2018Updated 7 years ago
- ☆11Jul 5, 2023Updated 2 years ago
- ☆12Jul 3, 2023Updated 2 years ago
- Code for the CVPR '23 paper, "Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning"☆10Jun 9, 2023Updated 2 years ago
- ☆13Sep 1, 2025Updated 5 months ago
- [Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis☆10Sep 23, 2021Updated 4 years ago
- Code for Spectral Norm of Convolutional Layers with Circular and Zero Paddings and Efficient Bound of Lipschitz Constant for Convolutiona…☆15Feb 2, 2024Updated 2 years ago
- ☆14Jan 26, 2025Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆17Jan 8, 2025Updated last year
- ☆13Jul 17, 2024Updated last year
- Code and full version of the paper "Hijacking Attacks against Neural Network by Analyzing Training Data"☆14Feb 28, 2024Updated last year
- AI Security Research☆15Jun 21, 2023Updated 2 years ago
- the instructions about request access to AdvDroidZero☆13Apr 10, 2024Updated last year
- Code repo for the UAI 2023 paper "Learning To Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning".☆16Jun 15, 2024Updated last year