xiaoweih / AISafetyLectureNotesLinks
Machine Learning Safety
☆42Updated this week
Alternatives and similar repositories for AISafetyLectureNotes
Users that are interested in AISafetyLectureNotes are comparing it to the libraries listed below
Sorting:
- Open source implementation of the TrojDRL algorithm presented in TrojDRL: Evaluation of backdoor attacks on Deep Reinforcement Learning☆19Updated 5 years ago
- Attack AlphaZero Go agents (NeurIPS 2022)☆22Updated 2 years ago
- ☆21Updated 3 years ago
- ☆27Updated 2 years ago
- [S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".☆30Updated 10 months ago
- alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, 2023, 2024, 2025)☆319Updated 9 months ago
- A united toolbox for running major robustness verification approaches for DNNs. [S&P 2023]☆90Updated 2 years ago
- Adversarial attacks on Deep Reinforcement Learning (RL)☆97Updated 4 years ago
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆25Updated 2 years ago
- Adversarial Example Attacks on Policy Learners☆40Updated 5 years ago
- Codes for reproducing the robustness evaluation scores in “Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approac…☆53Updated 7 years ago
- ☆17Updated 2 years ago
- Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"☆34Updated 5 years ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Updated 11 months ago
- CROWN: A Neural Network Verification Framework for Networks with General Activation Functions☆38Updated 6 years ago
- auto_LiRPA: An Automatic Linear Relaxation based Perturbation Analysis Library for Neural Networks and General Computational Graphs☆329Updated this week
- Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"☆109Updated 2 years ago
- This repo keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on …☆98Updated 3 years ago
- ☆22Updated 4 months ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆137Updated 3 years ago
- This repository contains a simple implementation of Interval Bound Propagation (IBP) using TensorFlow: https://arxiv.org/abs/1810.12715☆162Updated 5 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆34Updated 4 years ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆68Updated 4 years ago
- ☆29Updated 4 years ago
- Neural Network Verification Software Tool☆133Updated 2 weeks ago
- VNN Neural Network Verification Competition 2021☆36Updated 4 years ago
- Library containing PyTorch implementations of various adversarial attacks and resources☆164Updated last month
- A Simulated Optimal Intrusion Response Game☆21Updated 3 years ago
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries☆48Updated 3 months ago
- OVAL framework for BaB-based Neural Network Verification☆18Updated last month