π₯π₯π₯ Detecting hidden backdoors in Large Language Models with only black-box access
β55Jun 2, 2025Updated 10 months ago
Alternatives and similar repositories for BAIT
Users that are interested in BAIT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Modelsβ22Oct 5, 2025Updated 6 months ago
- β16Sep 4, 2024Updated last year
- β16Dec 29, 2023Updated 2 years ago
- Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)β11Mar 28, 2024Updated 2 years ago
- β18Aug 15, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- β20Feb 11, 2024Updated 2 years ago
- [NDSS'23] BEAGLE: Forensics of Deep Learning Backdoor Attack for Better Defenseβ17May 7, 2024Updated last year
- Official Implementation of NeurIPS 2024 paper - BiScope: AI-generated Text Detection by Checking Memorization of Preceding Tokensβ29Feb 17, 2026Updated 2 months ago
- Distribution Preserving Backdoor Attack in Self-supervised Learningβ20Jan 27, 2024Updated 2 years ago
- β14Feb 26, 2025Updated last year
- β26Dec 1, 2022Updated 3 years ago
- [NeurIPS 2025] BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Modelsβ297Mar 13, 2026Updated last month
- Implement of Implicit Knowledge Extraction Attack.β22Apr 17, 2026Updated 2 weeks ago
- Backdooring Neural Code Searchβ14Sep 8, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [AAAI'21] Deep Feature Space Trojan Attack of Neural Networks by Controlled Detoxificationβ30Dec 31, 2024Updated last year
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"β26Aug 20, 2025Updated 8 months ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacksβ20Sep 18, 2025Updated 7 months ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107β21Aug 10, 2024Updated last year
- Official repository for CVPR'23 paper: Detecting Backdoors in Pre-trained Encodersβ38Sep 25, 2023Updated 2 years ago
- β18Jun 15, 2021Updated 4 years ago
- [ICLR24] Official Repo of BadChain: Backdoor Chain-of-Thought Prompting for Large Language Modelsβ53Jul 24, 2024Updated last year
- β27Aug 28, 2024Updated last year
- Implementation of "Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches"β25Aug 31, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Example TrojAI Submissionβ27Dec 6, 2024Updated last year
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"β19Mar 10, 2025Updated last year
- Nyx: Detecting Exploitable Front-Running Vulnerabilities in Smart Contractsβ23May 11, 2024Updated last year
- Code for AAAI 2021 "Towards Feature Space Adversarial Attack".β30Aug 24, 2021Updated 4 years ago
- β13May 1, 2024Updated 2 years ago
- β19Mar 9, 2024Updated 2 years ago
- Code for NDSS 2022 paper "MIRROR: Model Inversion for Deep Learning Network with High Fidelity"β27May 9, 2023Updated 2 years ago
- Code for paper "The Philosopherβs Stone: Trojaning Plugins of Large Language Models"β29Sep 11, 2024Updated last year
- [ICLR 2023, Best Paper Award at ECCVβ22 AROW Workshop] FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learningβ60Dec 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β37Oct 17, 2024Updated last year
- β19Feb 25, 2024Updated 2 years ago
- β16May 23, 2024Updated last year
- [ECCV'24] UNIT: Backdoor Mitigation via Automated Neural Distribution Tighteningβ10Dec 18, 2025Updated 4 months ago
- [NDSS 2025] CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Samplingβ17Jan 18, 2025Updated last year
- β23Jan 5, 2026Updated 3 months ago
- This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."β24Apr 5, 2022Updated 4 years ago