lan-lc / adversarial_example_of_Go
Attack AlphaZero Go agents (NeurIPS 2022)
☆20Updated 2 years ago
Alternatives and similar repositories for adversarial_example_of_Go:
Users that are interested in adversarial_example_of_Go are comparing it to the libraries listed below
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆23Updated last year
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆24Updated last year
- "Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers" (NeurIPS 2019, previously called "A Stratified Approach …☆17Updated 5 years ago
- Rewarded soups official implementation☆55Updated last year
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆23Updated 2 years ago
- ☆11Updated 3 years ago
- Understanding Rare Spurious Correlations in Neural Network☆12Updated 2 years ago
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆13Updated 2 years ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆33Updated 6 months ago
- Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"☆23Updated 2 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆34Updated 11 months ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆17Updated last year
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Updated 2 years ago
- Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses, NeurIPS Spotlight 2020☆26Updated 4 years ago
- ☆53Updated last year
- Code for "BayesAdapter: Being Bayesian, Inexpensively and Robustly, via Bayeisan Fine-tuning"☆31Updated 6 months ago
- CROWN: A Neural Network Verification Framework for Networks with General Activation Functions☆38Updated 6 years ago
- ICLR 2023 paper "Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness" by Yuancheng Xu, Yanchao Sun, Micah Gold…☆24Updated last year
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Updated 2 months ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30Updated 2 years ago
- Code for the paper "MMA Training: Direct Input Space Margin Maximization through Adversarial Training"☆34Updated 4 years ago
- On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]☆35Updated 3 years ago
- ☆13Updated 3 months ago
- Code relative to "Adversarial robustness against multiple and single $l_p$-threat models via quick fine-tuning of robust classifiers"☆18Updated 2 years ago
- Benchmark for LP-relaxed robustness verification of ReLU-networks☆41Updated 5 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆28Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆10Updated 8 months ago
- Code for Stability Training with Noise (STN)☆21Updated 4 years ago
- Repository for Knowledge Enhanced Machine Learning Pipeline (KEMLP)☆10Updated 3 years ago