☆50Aug 3, 2024Updated last year
Alternatives and similar repositories for redteaming-resistance-benchmark
Users that are interested in redteaming-resistance-benchmark are comparing it to the libraries listed below
Sorting:
- LLM red teaming datasets from the paper 'Student-Teacher Prompting for Red Teaming to Improve Guardrails' for the ART of Safety Workshop …☆22Oct 12, 2023Updated 2 years ago
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10May 2, 2024Updated last year
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆37Feb 22, 2025Updated last year
- autoredteam: code for training models that automatically red team other language models☆15Aug 9, 2023Updated 2 years ago
- Python standalone tokenizer☆15Nov 12, 2015Updated 10 years ago
- NVIDIA’s repository for enabling trustworthy AI.☆27Updated this week
- Automated Safety Testing of Large Language Models☆18Jan 31, 2025Updated last year
- ☆16May 30, 2024Updated last year
- Netflix for XBMC☆61Nov 13, 2012Updated 13 years ago
- LLM evaluation.☆16Nov 7, 2023Updated 2 years ago
- Offical Repository of MetaAgent Program☆41Dec 2, 2025Updated 3 months ago
- ☆48May 9, 2024Updated last year
- Implementation of BEAST adversarial attack for language models (ICML 2024)☆90May 14, 2024Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Aug 3, 2024Updated last year
- ☆27Jul 20, 2024Updated last year
- Tree of Attacks (TAP) Jailbreaking Implementation☆118Feb 7, 2024Updated 2 years ago
- Qelm - Quantum Enhanced Language Model☆25Mar 3, 2026Updated last week
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆108Mar 8, 2024Updated 2 years ago
- A simple trick to get DataCamp Subscription free for 2 Months and using Data Camp Course Scrapper to grab all videos of a particular cour…☆13Jun 4, 2020Updated 5 years ago
- A library for red-teaming LLM applications with LLMs.☆29Oct 11, 2024Updated last year
- The Oyster series is a set of safety models developed in-house by Alibaba-AAIG, devoted to building a responsible AI ecosystem. | Oyster …☆59Sep 11, 2025Updated 5 months ago
- Learn How To Observe, Manage, and Scale, Agentic AI Apps Using Azure AI Foundry - with this hands-on workshop☆39Feb 5, 2026Updated last month
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆106Apr 15, 2024Updated last year
- Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!☆351Oct 17, 2025Updated 4 months ago
- Code to reproduce experiments from the EMNLP 2015 paper about Rumour Stance Classification with Gaussian Processes.☆37May 23, 2016Updated 9 years ago
- [AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts☆192Jun 26, 2025Updated 8 months ago
- LLM Self Defense: By Self Examination, LLMs know they are being tricked☆51May 21, 2024Updated last year
- 【ACL 2024】 SALAD benchmark & MD-Judge☆171Mar 8, 2025Updated last year
- Quantum computation is now encroaching in every field of science. One such use of quantum computation is shown here in a field of machine…☆10Aug 24, 2019Updated 6 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- KeepGPU is a simple CLI app that keeps your GPUs running.☆22Updated this week
- ☆12Jan 11, 2026Updated last month
- Public website for CodeX Academy.☆12Jan 27, 2023Updated 3 years ago
- The Matlab/Octave code for our paper "Towards fast embedded moving horizon state-of-charge estimation for lithium-ion batteries"☆12May 21, 2024Updated last year
- DOMIAS, a density-based MIA model that aims to infer membership by targeting local overfitting of the generative model.☆12May 29, 2023Updated 2 years ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Lock circuitgraphs using various logic locking techniques☆11May 2, 2023Updated 2 years ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year