lan-lc / adversarial_example_of_GoLinks
Attack AlphaZero Go agents (NeurIPS 2022)
☆22Updated 2 years ago
Alternatives and similar repositories for adversarial_example_of_Go
Users that are interested in adversarial_example_of_Go are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆25Updated 2 years ago
- Official TensorFlow implementation of "Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization" (ICML 2019)☆40Updated 4 years ago
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆16Updated 3 years ago
- MACER: MAximizing CErtified Radius (ICLR 2020)☆30Updated 5 years ago
- Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks☆39Updated 4 years ago
- A united toolbox for running major robustness verification approaches for DNNs. [S&P 2023]☆90Updated 2 years ago
- Repository for Knowledge Enhanced Machine Learning Pipeline (KEMLP)☆10Updated 4 years ago
- Tensorflow implementation of Meta Adversarial Training for Adversarial Patch Attacks on Tiny ImageNet.☆26Updated 4 years ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆23Updated 3 months ago
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆25Updated 5 months ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆34Updated 4 years ago
- ☆53Updated 2 years ago
- RayS: A Ray Searching Method for Hard-label Adversarial Attack (KDD2020)☆56Updated 5 years ago
- ICLR 2023 paper "Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness" by Yuancheng Xu, Yanchao Sun, Micah Gold…☆25Updated 2 years ago
- CROWN: A Neural Network Verification Framework for Networks with General Activation Functions☆39Updated 6 years ago
- Open source implementation of the TrojDRL algorithm presented in TrojDRL: Evaluation of backdoor attacks on Deep Reinforcement Learning☆19Updated 5 years ago
- Certified Patch Robustness via Smoothed Vision Transformers☆42Updated 3 years ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆24Updated 3 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Updated 2 years ago
- Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"☆34Updated 5 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- Sparse-RS: a versatile framework for query-efficient sparse black-box adversarial attacks☆46Updated 3 years ago
- A Self-Consistent Robust Error (ICML 2022)☆69Updated 2 years ago
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆12Updated 3 years ago
- Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses, NeurIPS Spotlight 2020☆27Updated 4 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆32Updated last year
- Code for the paper "(De)Randomized Smoothing for Certifiable Defense against Patch Attacks" by Alexander Levine and Soheil Feizi.☆17Updated 3 years ago
- The collection of papers about Private Evolution☆17Updated last month
- the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral☆59Updated 4 years ago
- "Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers" (NeurIPS 2019, previously called "A Stratified Approach …☆17Updated 5 years ago