About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)
☆15Jul 9, 2023Updated 2 years ago
Alternatives and similar repositories for Bayesian-Red-Teaming
Users that are interested in Bayesian-Red-Teaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of "Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian O…☆26Sep 26, 2023Updated 2 years ago
- ☆15Nov 22, 2023Updated 2 years ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆19May 13, 2024Updated 2 years ago
- ☆31Feb 8, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Aug 22, 2023Updated 2 years ago
- Official Pytorch Implementation of Unsupervised Representation Learning for Binary Networks by Joint Classifier Training (CVPR 2022)☆11Apr 10, 2022Updated 4 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 3 years ago
- Test LLMs against jailbreaks and unprecedented harms☆39Oct 19, 2024Updated last year
- Emoji Attack [ICML 2025]☆44Jul 15, 2025Updated 10 months ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆32Jan 3, 2026Updated 4 months ago
- Revisiting Character-level Adversarial Attacks for Language Models, ICML 2024☆19Feb 12, 2025Updated last year
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated last year
- ☆17Dec 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Dec 2, 2023Updated 2 years ago
- Benchmark for Biophysical Sequence Optimization Algorithms☆22Apr 15, 2026Updated last month
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- ☆60Mar 9, 2023Updated 3 years ago
- Code for "Multi-Objective GFlowNets"☆20Jul 12, 2023Updated 2 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated 2 years ago
- GenRM-CoT: Data release for verification rationales☆68Oct 16, 2024Updated last year
- A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation☆68May 8, 2026Updated 2 weeks ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆72Nov 1, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 3 years ago
- The official source code for TaleBrush (CHI 2022)☆15Jul 13, 2022Updated 3 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆69Feb 5, 2024Updated 2 years ago
- ☆18Jun 4, 2025Updated 11 months ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)☆250Jun 29, 2023Updated 2 years ago
- ☆14May 9, 2024Updated 2 years ago
- Code and data for the ACM CIKM 2024 paper "Adversarial Text Rewriting for Text-aware Recommender Systems"☆12Aug 1, 2024Updated last year
- [TOIS'24] "RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation"☆16Dec 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean☆48Dec 23, 2024Updated last year
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆35Oct 2, 2025Updated 7 months ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆15Jun 21, 2023Updated 2 years ago
- Tensorflow implementation of TrialAttack (Triple Adversarial Learning for Influence based Poisoning Attack in Recommender Systems. KDD 20…☆12Sep 2, 2021Updated 4 years ago
- Official repository for "Embodied Agents Meet Personalization: Investigating Challenges and Solutions Through the Lens of Memory Utilizat…☆22Oct 24, 2025Updated 7 months ago
- ☆21Mar 17, 2025Updated last year