rmin2000/adv_tracing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rmin2000/adv_tracing)

rmin2000 / adv_tracing

Identification of the Adversary from a Single Adversarial Example (ICML 2023)

☆10

Alternatives and similar repositories for adv_tracing

Users that are interested in adv_tracing are comparing it to the libraries listed below

Sorting:

AISafety-HKUST / Backdoor_Safety_Tuning
View on GitHub
Backdoor Safety Tuning (NeurIPS 2023 & 2024 Spotlight)
☆27Nov 18, 2024Updated last year
Alan-Qin / Transfer_attack_RAP
View on GitHub
Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation (NeurIPS 2022)
☆33Dec 16, 2022Updated 3 years ago
qizhangli / MoreBayesian-attack
View on GitHub
Code for our ICLR 2023 paper Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples.
☆18May 31, 2023Updated 2 years ago
git-disl / Vaccine
View on GitHub
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆49Jan 15, 2026Updated last month
ash-aldujaili / blackbox-adv-examples-signhunter
View on GitHub
A repository for the query-efficient black-box attack, SignHunter
☆23Jan 15, 2020Updated 6 years ago
context-machine-lab / ContextAgent
View on GitHub
Context-central multi-agent framework with PyTorch-like API. Build intelligent agent systems with minimal code.
☆73Oct 26, 2025Updated 4 months ago
rkteddy / channel-Lipschitzness-based-pruning
View on GitHub
Source code for ECCV 2022 Poster: Data-free Backdoor Removal based on Channel Lipschitzness
☆35Jan 9, 2023Updated 3 years ago
thestephencasper / benchmarking_interpretability
View on GitHub
☆35Sep 13, 2023Updated 2 years ago
ffhibnese / CGNC_Targeted_Adversarial_Attacks
View on GitHub
[ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks
☆38Apr 23, 2025Updated 10 months ago
milesaturpin / cot-unfaithfulness
View on GitHub
☆52Oct 23, 2023Updated 2 years ago
liuchen11 / AdversaryLossLandscape
View on GitHub
On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]
☆36Jul 3, 2021Updated 4 years ago
THU-KEG / PairJudgeRM
View on GitHub
☆14Apr 14, 2025Updated 10 months ago
JiayuJeff / CostBench
View on GitHub
The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…
☆27Dec 10, 2025Updated 2 months ago
dieuroi / SimAesthetics
View on GitHub
☆10May 18, 2024Updated last year
measure-infinity / mulan-code
View on GitHub
☆41Jul 16, 2024Updated last year
aengusl / latent-adversarial-training
View on GitHub
☆48Sep 29, 2024Updated last year
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated last year
fabienbaradel / Tensorflow-tutorials
View on GitHub
Seminar: intro to deep learning with tensorflow
☆13Jun 27, 2017Updated 8 years ago
Jinxiaolong1129 / Foot-in-the-door-Jailbreak
View on GitHub
☆19May 14, 2025Updated 9 months ago
ZiangYan / subspace-attack.pytorch
View on GitHub
Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks
☆10Dec 16, 2019Updated 6 years ago
TheoGuyard / El0ps
View on GitHub
El0ps: An Exact L0-Problem Solver
☆13Jan 6, 2026Updated last month
SCIR-SC-Qiaoban-Team / FreeEvalLM
View on GitHub
[AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…
☆10Feb 7, 2026Updated 3 weeks ago
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated last month
YihanWang617 / On-ell_p-Robustness-of-Ensemble-Stumps-and-Trees
View on GitHub
Code of On L-p Robustness of Decision Stumps and Trees, ICML 2020
☆10Aug 3, 2020Updated 5 years ago
yuxiang-gao / awesome-llm-blogs
View on GitHub
Blogs that I'm actively following.
☆13Sep 17, 2023Updated 2 years ago
inspire-group / OOD-Attacks
View on GitHub
Attacks using out-of-distribution adversarial examples
☆11Nov 19, 2019Updated 6 years ago
IBM / Adversarial-Prompt-Evaluation
View on GitHub
Code Implementation of Adversarial Prompt Evaluation paper
☆14Sep 18, 2025Updated 5 months ago
clearloveclearlove / BEAT
View on GitHub
☆14Feb 26, 2025Updated last year
zjunlp / LookAheadTuning
View on GitHub
[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆17Dec 14, 2025Updated 2 months ago
zifanw / boundary
View on GitHub
Implementation of Boundary Attributions for Normal (Vector) Explanations
☆11Aug 13, 2021Updated 4 years ago
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆47Oct 10, 2024Updated last year
reds-lab / Universal_Pert_Cert
View on GitHub
This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…
☆12Feb 14, 2023Updated 3 years ago
princeton-polaris-lab / Evaluating-Durable-Safeguards
View on GitHub
[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
☆13Jun 20, 2025Updated 8 months ago
Li-Hyn / LLM_CatastrophicForgetting
View on GitHub
Code for LLM_Catastrophic_Forgetting via SAM.
☆11Jun 7, 2024Updated last year
dayu11 / Availability-Attacks-Create-Shortcuts
View on GitHub
☆10Jul 28, 2022Updated 3 years ago
dangne / tmd
View on GitHub
[EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples
☆11Apr 6, 2023Updated 2 years ago
IBM / NeuralFuse
View on GitHub
[NeurIPS'24] "NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes"
☆10Sep 18, 2025Updated 5 months ago
lamda-bbo / mcts-transfer
View on GitHub
Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".
☆12Nov 28, 2024Updated last year
EmpathYang / ADEPT
View on GitHub
Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).
☆15Dec 13, 2024Updated last year