lan-lc / adversarial_example_of_GoLinks
Attack AlphaZero Go agents (NeurIPS 2022)
☆22Updated 3 years ago
Alternatives and similar repositories for adversarial_example_of_Go
Users that are interested in adversarial_example_of_Go are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆17Updated 3 years ago
- Repository for Knowledge Enhanced Machine Learning Pipeline (KEMLP)☆10Updated 4 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆25Updated 2 years ago
- ☆53Updated 2 years ago
- Official TensorFlow implementation of "Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization" (ICML 2019)☆41Updated 5 years ago
- ICLR 2023 paper "Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness" by Yuancheng Xu, Yanchao Sun, Micah Gold…☆25Updated 2 years ago
- Code relative to "Adversarial robustness against multiple and single $l_p$-threat models via quick fine-tuning of robust classifiers"☆19Updated 3 years ago
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆25Updated 6 months ago
- Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks☆39Updated 4 years ago
- A united toolbox for running major robustness verification approaches for DNNs. [S&P 2023]☆90Updated 2 years ago
- "Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers" (NeurIPS 2019, previously called "A Stratified Approach …☆17Updated 6 years ago
- Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"☆34Updated 5 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆32Updated 2 years ago
- kyleliang919 / Uncovering-the-Connections-BetweenAdversarial-Transferability-and-Knowledge-Transferabilitycode for ICML 2021 paper in which we explore the relationship between adversarial transferability and knowledge transferability.☆17Updated 3 years ago
- the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral☆59Updated 4 years ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆25Updated 4 months ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆24Updated 3 years ago
- The official repo for GCP-CROWN paper☆13Updated 3 years ago
- Implementation of Confidence-Calibrated Adversarial Training (CCAT).☆45Updated 5 years ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Updated last year
- CROWN: A Neural Network Verification Framework for Networks with General Activation Functions☆39Updated 7 years ago
- Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses, NeurIPS Spotlight 2020☆27Updated 4 years ago
- Code for the paper "(De)Randomized Smoothing for Certifiable Defense against Patch Attacks" by Alexander Levine and Soheil Feizi.☆17Updated 3 years ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆20Updated 2 years ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆38Updated last year
- Certified robustness "for free" using off-the-shelf diffusion models and classifiers☆44Updated 2 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Updated 2 years ago
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆12Updated 3 years ago
- MACER: MAximizing CErtified Radius (ICLR 2020)☆30Updated 5 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Updated 4 years ago