xiaoweih / AISafetyLectureNotesLinks

Machine Learning Safety

☆38

Alternatives and similar repositories for AISafetyLectureNotes

Users that are interested in AISafetyLectureNotes are comparing it to the libraries listed below

Sorting:

pkiourti / rl_backdoor
Open source implementation of the TrojDRL algorithm presented in TrojDRL: Evaluation of backdoor attacks on Deep Reinforcement Learning
☆19Updated 4 years ago
lan-lc / adversarial_example_of_Go
Attack AlphaZero Go agents (NeurIPS 2022)
☆21Updated 2 years ago
Verified-Intelligence / alpha-beta-CROWN
alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, 2023, and 2024)
☆296Updated 5 months ago
Henrygwb / edge
☆20Updated 3 years ago
IBM / CLEVER-Robustness-Score
Codes for reproducing the robustness evaluation scores in “Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approac…
☆52Updated 6 years ago
davide97l / rl-policies-attacks-defenses
Adversarial attacks on Deep Reinforcement Learning (RL)
☆91Updated 4 years ago
IBM / CROWN-Robustness-Certification
CROWN: A Neural Network Verification Framework for Networks with General Activation Functions
☆38Updated 6 years ago
2019ChenGong / Offline_RL_Poisoner
Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets", IEEE S&P 2024.
☆28Updated 6 months ago
AI-secure / VeriGauge
A united toolbox for running major robustness verification approaches for DNNs. [S&P 2023]
☆90Updated 2 years ago
Verified-Intelligence / auto_LiRPA
auto_LiRPA: An Automatic Linear Relaxation based Perturbation Analysis Library for Neural Networks and General Computational Graphs
☆318Updated 4 months ago
nuwuxian / rl_attack
☆27Updated 2 years ago
verivital / nnv
Neural Network Verification Software Tool
☆127Updated last week
YisenWang / dynamic_adv_training
Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"
☆34Updated 5 years ago
snu-mllab / DiscreteBlockBayesAttack
Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…
☆25Updated last year
AI-secure / Certified-Robustness-SoK-Oldver
This repo keeps track of popular provable training and verification approaches towards robust neural networks, including leaderboards on …
☆98Updated 2 years ago
huanzhang12 / CertifiedReLURobustness
Efficient Robustness Verification for ReLU networks (this repository is outdated, don't use; checkout our new implementation at https://g…
☆30Updated 5 years ago
Sizhe-Chen / StruQ
official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries
☆43Updated last month
ARiSE-Lab / deepTest
A systematic testing tool for automatically detecting erroneous behaviors of DNN-driven vehicles
☆80Updated 6 years ago
KaidiXu / Beta-CROWN
β-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Verification
☆29Updated 3 years ago
ALFA-group / adversarial-code-generation
[ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shi…
☆30Updated 3 years ago
google-deepmind / interval-bound-propagation
This repository contains a simple implementation of Interval Bound Propagation (IBP) using TensorFlow: https://arxiv.org/abs/1810.12715
☆161Updated 5 years ago
UCLA-SEAL / DeepLearningTest
Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)
☆10Updated 3 years ago
PurduePAML / Machine-Learning-Security-Seminar
Machine Learning & Security Seminar @Purdue University
☆25Updated 2 years ago
ifding / adversarial-examples
Adversarial Examples: Attacks and Defenses for Deep Learning
☆32Updated 7 years ago
AI-secure / Meta-Nerual-Trojan-Detection
☆66Updated 4 years ago
T1aNS1R / Evil-Geniuses
☆68Updated last year
facebookresearch / text-adversarial-attack
Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"
☆107Updated 2 years ago
boschresearch / meta-adversarial-training
Tensorflow implementation of Meta Adversarial Training for Adversarial Patch Attacks on Tiny ImageNet.
☆25Updated 4 years ago
DIG-Beihang / X-adv
Official PyTorch implemetation of paper "X-Adv: Physical Adversarial Object Attacks against X-ray Prohibited Item Detection".
☆15Updated 2 years ago
shizhouxing / Robustness-Verification-for-Transformers
[ICLR 2020] Code for paper "Robustness Verification for Transformers"
☆27Updated 7 months ago