NayMyatMin/CROW

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NayMyatMin/CROW)

NayMyatMin / CROW

Internal Consistency Regularization (CROW) for LLM Backdoor Elimination - Paper accepted to ICML 2025

☆16

Alternatives and similar repositories for CROW

Users that are interested in CROW are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lancopku / RAP
View on GitHub
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
☆25Oct 21, 2021Updated 4 years ago
gao-xiao-bai / JsonTuning
View on GitHub
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
☆10Nov 3, 2024Updated last year
clearloveclearlove / BEAT
View on GitHub
☆15Feb 26, 2025Updated last year
dgl-prc / m_testing_adversatial_sample
View on GitHub
☆26May 27, 2020Updated 6 years ago
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OSU-NLP-Group / AgentAttack
View on GitHub
☆22Oct 25, 2024Updated last year
ethz-spylab / autoadvexbench
View on GitHub
☆42May 21, 2025Updated last year
carriex / lfqa_eval
View on GitHub
ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"
☆21Mar 22, 2024Updated 2 years ago
baixianghuang / editing-attack
View on GitHub
Code and dataset for the paper: "Can Editing LLMs Inject Harm?" [AAAI'26]
☆21Dec 26, 2025Updated 6 months ago
Di-viner / LLM-Robustness-to-Irrelevant-Information
View on GitHub
[COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
☆23Oct 13, 2024Updated last year
Yoruko-Tang / ModelGuard
View on GitHub
Official implementation of the USENIX Security 2024 paper ModelGuard: Information-Theoretic Defense Against Model Extraction Attacks.
☆25Dec 6, 2023Updated 2 years ago
m3yrin / aligned-cross-entropy
View on GitHub
Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655
☆21Jul 25, 2024Updated last year
LucasFenaux / PILLAR-ESPN
View on GitHub
Code for the paper: Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions
☆12Mar 13, 2024Updated 2 years ago
AmbitYuki / RoFed-LLM
View on GitHub
Robust Federated Learning for Large Language Models in Adversarial Wireless Environments
☆16Mar 7, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
khoaguin / HESplitNet
View on GitHub
Two-party Privacy-preserving Neural Network Training using Split Learning and Homomorphic Encryption (CKKS Scheme)
☆12Sep 23, 2025Updated 9 months ago
Tianshi-Xu / PrivCirNet
View on GitHub
[NeurIPS'24] Official implement of "PrivCirNet: Efficient Private Inference via Block Circulant Transformation"
☆14Feb 26, 2026Updated 4 months ago
seth-lu / Im2win
View on GitHub
☆14May 23, 2023Updated 3 years ago
mchen725 / DD_IGD
View on GitHub
[ICLR 2025] Official repository for the paper "Influence-Guided Diffusion for Dataset Distillation".
☆15Feb 12, 2025Updated last year
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
PKU-SEC-Lab / mpcvit
View on GitHub
Code release for MPCViT accepted by ICCV 2023
☆16Jan 6, 2025Updated last year
radhika1601 / ScalableMixedModeMPC
View on GitHub
Implementation for the protocols described in https://eprint.iacr.org/2023/1700
☆14Apr 29, 2026Updated 2 months ago
Yunhao-Feng / AgentHazard
View on GitHub
☆28Jun 13, 2026Updated last month
human-analysis / AutoFHE
View on GitHub
Official implementation for AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE. The paper is presented at the 33rd USE…
☆34Nov 24, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AI-secure / FedGame
View on GitHub
Official implementation for paper "FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning" (NeurIPS 2023).
☆13Oct 25, 2024Updated last year
Sandy-Zeng / NPAttack
View on GitHub
Pytorch implementation of NPAttack
☆12Jul 7, 2020Updated 6 years ago
wanghangpsu / MM-BD
View on GitHub
The implementation of the IEEE S&P 2024 paper MM-BD: Post-Training Detection of Backdoor Attacks with Arbitrary Backdoor Pattern Types Us…
☆16May 12, 2024Updated 2 years ago
KaiyuanZh / CENSOR
View on GitHub
[NDSS 2025] CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling
☆19Jan 18, 2025Updated last year
AI-secure / Knowledge-Enhanced-Machine-Learning-Pipeline
View on GitHub
Repository for Knowledge Enhanced Machine Learning Pipeline (KEMLP)
☆10Jun 5, 2021Updated 5 years ago
NLP2CT / ConsistTL
View on GitHub
Implementation of our paper in EMNLP 2022, focused on the relationship between parent and child in transfer learning for low-resourc…
☆17Dec 7, 2022Updated 3 years ago
Jinxhy / AppAIsecurity
View on GitHub
[ICSE-SEIP'21] Robustness of on-device Models: AdversarialAttack to Deep Learning Models on Android Apps
☆15Jun 2, 2022Updated 4 years ago
liuzrcc / ImageShortcutSqueezing
View on GitHub
Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression
☆14Mar 22, 2025Updated last year
UM-Data-Intelligence-Lab / NYLON_code
View on GitHub
☆20Feb 18, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
snu-ccl / approxCNN
View on GitHub
☆17Feb 3, 2022Updated 4 years ago
UM-Data-Intelligence-Lab / HELIOS_code
View on GitHub
☆20Oct 29, 2023Updated 2 years ago
albert-y1n / PISmith
View on GitHub
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
☆21Updated this week
Jayfeather1024 / Backdoor-Enhanced-Alignment
View on GitHub
☆24Dec 8, 2024Updated last year
PurduePAML / DBS
View on GitHub
☆18Aug 15, 2022Updated 3 years ago
sokcertifiedrobustness / VeriGauge-deprecated
View on GitHub
☆11Oct 18, 2022Updated 3 years ago
wenzhifang / Federated-Sketching-LoRA-Implementation
View on GitHub
☆28May 21, 2025Updated last year