lan-lc / adversarial_example_of_Go
Attack AlphaZero Go agents (NeurIPS 2022)
☆21Updated 2 years ago
Alternatives and similar repositories for adversarial_example_of_Go
Users that are interested in adversarial_example_of_Go are comparing it to the libraries listed below
Sorting:
- Official TensorFlow implementation of "Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization" (ICML 2019)☆40Updated 4 years ago
- Repository for Knowledge Enhanced Machine Learning Pipeline (KEMLP)☆10Updated 3 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆36Updated last year
- Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks☆38Updated 3 years ago
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆14Updated 2 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆24Updated last year
- Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses, NeurIPS Spotlight 2020☆27Updated 4 years ago
- ICLR 2023 paper "Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness" by Yuancheng Xu, Yanchao Sun, Micah Gold…☆25Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆24Updated last year
- Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]☆20Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆10Updated 11 months ago
- ☆54Updated 2 years ago
- On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]☆36Updated 3 years ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆53Updated last year
- ☆20Updated 5 months ago
- ☆11Updated 3 years ago
- ☆53Updated last year
- Implementation of Confidence-Calibrated Adversarial Training (CCAT).☆45Updated 4 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Updated 2 years ago
- Code for the paper "Consistency Regularization for Certified Robustness of Smoothed Classifiers" (NeurIPS 2020)☆34Updated 4 years ago
- ☆31Updated last year
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 3 years ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆23Updated 2 years ago
- Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"☆34Updated 5 years ago
- Adversarial Distributional Training (NeurIPS 2020)☆63Updated 4 years ago
- Code relative to "Adversarial robustness against multiple and single $l_p$-threat models via quick fine-tuning of robust classifiers"☆18Updated 2 years ago
- kyleliang919 / Uncovering-the-Connections-BetweenAdversarial-Transferability-and-Knowledge-Transferabilitycode for ICML 2021 paper in which we explore the relationship between adversarial transferability and knowledge transferability.☆17Updated 2 years ago
- Host CIFAR-10.2 Data Set☆13Updated 3 years ago
- the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral☆59Updated 4 years ago