SolidShen / BAIT
🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access
☆15Updated 3 months ago
Alternatives and similar repositories for BAIT:
Users that are interested in BAIT are comparing it to the libraries listed below
- ☆16Updated 5 months ago
- Machine Learning & Security Seminar @Purdue University☆25Updated last year
- [NDSS'23] BEAGLE: Forensics of Deep Learning Backdoor Attack for Better Defense☆17Updated 9 months ago
- ☆14Updated 2 years ago
- [IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models☆13Updated last month
- Distribution Preserving Backdoor Attack in Self-supervised Learning☆14Updated last year
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Updated 2 years ago
- Seminar 2022☆22Updated 2 weeks ago
- Official repository for CVPR'23 paper: Detecting Backdoors in Pre-trained Encoders☆31Updated last year
- ☆24Updated 4 months ago
- Code release for DeepJudge (S&P'22)☆50Updated last year
- ☆18Updated 11 months ago
- Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)☆11Updated 10 months ago
- Code repository for the paper --- [USENIX Security 2023] Towards A Proactive ML Approach for Detecting Backdoor Poison Samples☆25Updated last year
- ☆14Updated last year
- ☆22Updated 4 months ago
- ☆20Updated 5 months ago
- [IEEE S&P 2024] Exploring the Orthogonality and Linearity of Backdoor Attacks☆21Updated last month
- ☆24Updated 3 years ago
- ☆10Updated 3 years ago
- ☆26Updated 2 years ago
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"☆15Updated 6 months ago
- ☆64Updated 4 years ago
- This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."☆24Updated 2 years ago
- Learning Security Classifiers with Verified Global Robustness Properties (CCS'21) https://arxiv.org/pdf/2105.11363.pdf☆27Updated 3 years ago
- ☆79Updated 3 years ago
- ☆18Updated 6 months ago
- ☆17Updated 2 years ago