Attack AlphaZero Go agents (NeurIPS 2022)
☆22Dec 3, 2022Updated 3 years ago
Alternatives and similar repositories for adversarial_example_of_Go
Users that are interested in adversarial_example_of_Go are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆140Nov 16, 2021Updated 4 years ago
- Official Code for Scaling Adversarial Training to Large Perturbation Bounds (ECCV-2022)☆11Nov 25, 2022Updated 3 years ago
- A Benchmark Study on Machine Learning Methods for Fake News Detection☆16Jun 8, 2021Updated 4 years ago
- Accept by CVPR 2025 (highlight)☆22Jun 8, 2025Updated 8 months ago
- This repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss…☆32Jan 21, 2022Updated 4 years ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆68Jan 26, 2021Updated 5 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…☆23Jul 26, 2024Updated last year
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- ☆91Jan 15, 2025Updated last year
- [NeurIPS 2022] Code for paper "Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation"☆27Dec 10, 2023Updated 2 years ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆28Oct 20, 2025Updated 4 months ago
- ☆70Feb 4, 2024Updated 2 years ago
- Codes for reproducing the robustness evaluation scores in “Evaluating the Robustness of Neural Networks: An Extreme Value Theory Approac…☆52Sep 18, 2018Updated 7 years ago
- Code to generate NeuralExecs (prompt injection for LLMs)☆27Oct 5, 2025Updated 4 months ago
- The official PyTorch Implementation of the Paper "Adversarial Visual Robustness by Causal Intervention"☆18Oct 6, 2021Updated 4 years ago
- Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment (NeurIPS 2025)☆49Nov 5, 2025Updated 3 months ago
- Robustify Black-Box Models (ICLR'22 - Spotlight)☆24Jan 29, 2023Updated 3 years ago
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆25Jun 7, 2025Updated 8 months ago
- ☆29Oct 24, 2023Updated 2 years ago
- Code for Stability Training with Noise (STN)☆22Dec 27, 2020Updated 5 years ago
- Code to conduct an embedding attack on LLMs☆31Jan 10, 2025Updated last year
- 爱恩斯坦棋代码☆10Nov 24, 2020Updated 5 years ago
- [S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".☆32Dec 30, 2024Updated last year
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Nov 26, 2024Updated last year
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆79May 21, 2023Updated 2 years ago
- ☆12May 6, 2022Updated 3 years ago
- The code of "Hide in Thicket: Generating Imperceptible and Rational Adversarial Perturbations on 3D Point Clouds" CVPR 2024☆36Mar 23, 2024Updated last year
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Feb 22, 2021Updated 5 years ago
- Code for ICML2019 Paper "On the Convergence and Robustness of Adversarial Training"☆34Apr 28, 2020Updated 5 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 亚马逊棋冠军程序细节☆11Jan 7, 2026Updated last month
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago