Official Code for reproductivity of the NeurIPS 2023 paper: Adversarial Examples Are Not Real Features
☆16Jun 27, 2024Updated last year
Alternatives and similar repositories for AdvNotRealFeatures
Users that are interested in AdvNotRealFeatures are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- SEAT☆21Oct 10, 2023Updated 2 years ago
- ☆21Mar 14, 2025Updated last year
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated 10 months ago
- [ICCV 2023] Among Us: Adversarially Robust Collaborative Perception by Consensus☆21Feb 18, 2024Updated 2 years ago
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- [ICLR 2023] Official repository of the paper "Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning"☆18Feb 19, 2023Updated 3 years ago
- CVPR 2023 generalist☆16Oct 25, 2023Updated 2 years ago
- ☆25May 31, 2024Updated last year
- [NeurIPS2023] Black-box Backdoor Defense via Zero-shot Image Purification☆16Oct 31, 2023Updated 2 years ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆26Nov 15, 2023Updated 2 years ago
- [BMVC 2023] Semantic Adversarial Attacks via Diffusion Models☆25Nov 30, 2023Updated 2 years ago
- [NeurIPS2021] Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks☆33Jul 5, 2024Updated last year
- ☆17Aug 17, 2021Updated 4 years ago
- Towards Defending against Adversarial Examples via Attack-Invariant Features☆12Oct 12, 2023Updated 2 years ago
- data augmentation alone can improve adversarial training☆15Mar 24, 2023Updated 2 years ago
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆65Jan 11, 2025Updated last year
- Official code for ICLR 2023 paper "ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond "☆35Apr 24, 2023Updated 2 years ago
- Official code for FAccT'21 paper "Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning" https://arxiv.org/abs…☆13Mar 9, 2021Updated 5 years ago
- Explore visualization tools for understanding Transformer-based large language models (LLMs)☆22Dec 1, 2024Updated last year
- Final Project for AM 207, Fall 2021. Review & experimentation with paper "Adversarial Examples Are Not Bugs, They Are Features"☆10Dec 17, 2021Updated 4 years ago
- Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"☆34Apr 28, 2020Updated 5 years ago
- Pytorch implementation of NPAttack☆12Jul 7, 2020Updated 5 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Code Repo for the NeurIPS 2023 paper "VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models"☆28Sep 18, 2025Updated 6 months ago
- An End-to-End Trainable Method for Generating and Detecting Fiducial Markers☆13May 29, 2021Updated 4 years ago
- Code to conduct an embedding attack on LLMs☆31Jan 10, 2025Updated last year
- Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression☆14Mar 22, 2025Updated 11 months ago
- Official Implementation of wd1☆24Sep 25, 2025Updated 5 months ago
- Code release for Intensity Harmonization for Airborne LiDAR☆11Apr 7, 2022Updated 3 years ago
- Geometric Adversarial Attacks and Defenses on 3D Point Clouds (3DV 2021)☆26Jun 25, 2023Updated 2 years ago
- AdvDiffuser: Natural Adversarial Example Synthesis with Diffusion Models (ICCV 2023)☆19Jul 22, 2023Updated 2 years ago
- ☆22Feb 24, 2026Updated 3 weeks ago
- Official repo of "Self-Supervised-3D-Data-Association"☆11Jan 20, 2021Updated 5 years ago
- [NeurIPS 2023] Code for the paper "Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threa…☆39Dec 3, 2024Updated last year
- 利用简单的代码完成deepseek基于medical-o1-sft数据集的lora微调☆15Feb 25, 2025Updated last year
- Pytorch implementation of gradCAM, guidedBackProp, smoothGrad☆13Mar 5, 2019Updated 7 years ago
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆13Jul 1, 2022Updated 3 years ago
- ☆12Apr 27, 2022Updated 3 years ago