lan-lc / adversarial_example_of_Go
Attack AlphaZero Go agents (NeurIPS 2022)
☆20Updated 2 years ago
Alternatives and similar repositories for adversarial_example_of_Go:
Users that are interested in adversarial_example_of_Go are comparing it to the libraries listed below
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆23Updated 2 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- "Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers" (NeurIPS 2019, previously called "A Stratified Approach …☆17Updated 5 years ago
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆14Updated 2 years ago
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆23Updated last year
- Rewarded soups official implementation☆56Updated last year
- Official TensorFlow implementation of "Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization" (ICML 2019)☆40Updated 4 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆24Updated last year
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆33Updated 4 years ago
- ☆11Updated 3 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 2 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- Code for Stability Training with Noise (STN)☆21Updated 4 years ago
- Code for the paper "Consistency Regularization for Certified Robustness of Smoothed Classifiers" (NeurIPS 2020)☆35Updated 4 years ago
- ☆53Updated 2 years ago
- Implementation of Confidence-Calibrated Adversarial Training (CCAT).☆45Updated 4 years ago
- Benchmark for LP-relaxed robustness verification of ReLU-networks☆41Updated 5 years ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆18Updated last year
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆63Updated 4 years ago
- Reference implementations for RecurJac, CROWN, FastLin and FastLip (Neural Network verification and robustness certification algorithms)…☆26Updated 5 years ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆34Updated 7 months ago
- Code and data for the ICLR 2021 paper "Perceptual Adversarial Robustness: Defense Against Unseen Threat Models".☆55Updated 3 years ago
- ☆43Updated last year
- Host CIFAR-10.2 Data Set☆13Updated 3 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆32Updated last year
- Certified Patch Robustness via Smoothed Vision Transformers☆42Updated 3 years ago
- ICLR 2023 paper "Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness" by Yuancheng Xu, Yanchao Sun, Micah Gold…☆24Updated last year
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated last year
- Efficient Robustness Verification for ReLU networks (this repository is outdated, don't use; checkout our new implementation at https://g…☆30Updated 5 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆127Updated 3 years ago