TrustAIRLab/HateBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TrustAIRLab/HateBench)

TrustAIRLab / HateBench

[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns

☆15

Alternatives and similar repositories for HateBench

Users that are interested in HateBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cispa / mwait
View on GitHub
Proof-of-concept implementation for the paper "(M)WAIT for It: Bridging the Gap between Microarchitectural and Architectural Side Channel…
☆29Nov 30, 2023Updated 2 years ago
cispa / ShadowLoad
View on GitHub
☆14Apr 1, 2025Updated last year
seadog007 / smartcontract_ctfgame
View on GitHub
The CTF questions about smart contracts
☆11Sep 1, 2018Updated 7 years ago
cispa / regcheck
View on GitHub
Proof-of-concept implementation for the paper "Reviving Meltdown 3a" (ESORICS 2023)
☆17Sep 25, 2023Updated 2 years ago
CosmosYi / ReasoningShield
View on GitHub
ReasoningShield: Safety Detection over Reasoning Traces of Large Reasoning Models
☆26Sep 27, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MinhDucBui / Multi3Hate
View on GitHub
☆15Jan 6, 2025Updated last year
SALT-NLP / search_privacy_risk
View on GitHub
Code for the paper "Searching Privacy Risks in Multi-Agent Systems via Simulation"
☆24Oct 13, 2025Updated 9 months ago
ambergroup-labs / papora
View on GitHub
☆16Apr 6, 2023Updated 3 years ago
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
UCSB-AI / SafeKey
View on GitHub
[EMNLP 2025] Official code for the paper "SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning"
☆16May 12, 2026Updated 2 months ago
lmarena / arena-catalog
View on GitHub
☆15Dec 17, 2025Updated 7 months ago
TrustAIRLab / VoiceJailbreakAttack
View on GitHub
Code for Voice Jailbreak Attacks Against GPT-4o.
☆38May 31, 2024Updated 2 years ago
google-parfait / cvm-side-channel-analysis
View on GitHub
☆15Aug 12, 2025Updated 11 months ago
cispa / CacheWarp
View on GitHub
Proof-of-concept implementation for the paper "CacheWarp: Software-based Fault Injection using Selective State Reset" (USENIX Security 20…
☆65Aug 12, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
b-shi / PMC-PMI
View on GitHub
Performance Counter Measurements at the cycle granularity
☆19Jul 9, 2021Updated 5 years ago
renxida / iseedeaduops-poc
View on GitHub
Proof-of-concept for I See Dead Micro-Ops transient execution attack
☆15Nov 3, 2021Updated 4 years ago
SecurityNet-Research / SecurityNet
View on GitHub
☆14Apr 11, 2024Updated 2 years ago
verazuo / Typecho-zanshang
View on GitHub
支持Typecho1.1的赞赏功能代码
☆15Aug 25, 2018Updated 7 years ago
SaFo-Lab / DoxBench
View on GitHub
[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"
☆30Feb 7, 2026Updated 5 months ago
JingbiaoMei / RGCL
View on GitHub
📄 ACL 2024: RGCL, Retrieval-Guided Contrastive Learning for Hateful Meme Detection 📄 EMNLP 2025 (Oral): RA-HMD, Robust Adaptation of La…
☆40Mar 1, 2026Updated 4 months ago
hydroo / macos-core-to-core-latency
View on GitHub
Core-to-core latency benchmark that works on Apple MacOS without hard affinity
☆20May 9, 2026Updated 2 months ago
s8lvg / rowhammer-revisited-talk
View on GitHub
A list of resources for the talk Rowhammer Revisited: From Exploration to Exploitation and Mitigation
☆15Dec 13, 2023Updated 2 years ago
PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SproutNan / AI-Safety_Benchmark
View on GitHub
The official repository for guided jailbreak benchmark
☆31Jul 28, 2025Updated 11 months ago
YitingQu / meme-evolution
View on GitHub
☆16Jul 26, 2024Updated last year
cispa / LLCSliceReversing
View on GitHub
Artifact for the IEEE S&P 2025 paper: "Rapid Reversing of Non-Linear CPU Cache Slice Functions: Unlocking Physical Address Leakage"
☆20Apr 3, 2026Updated 3 months ago
EchoSafe-MLLM / EchoSafe
View on GitHub
[CVPR 2026] Code for Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
☆15Mar 18, 2026Updated 4 months ago
joey1993 / bert-defender
View on GitHub
codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19
☆15Feb 25, 2020Updated 6 years ago
ShallowU / cs336-assignment
View on GitHub
solution for cs336-assignment1,2,5 , including colab code link and blog link.
☆15Feb 20, 2026Updated 5 months ago
packmad / fprem-anti-emulation
View on GitHub
☆21Mar 20, 2026Updated 4 months ago
tmllab / 2025_ICLR_PiF
View on GitHub
☆40May 17, 2025Updated last year
YancyKahn / CoA
View on GitHub
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆39Jan 17, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
eliaskousk / vmrun
View on GitHub
Simple AMD-V (SVM) Virtualization Extensions Demo
☆22Nov 7, 2017Updated 8 years ago
stekhn / got-relationships
View on GitHub
Game of Thrones Relationship Chart
☆13Oct 15, 2019Updated 6 years ago
0xhilbert / Platypus
View on GitHub
Platypus Educational Samples
☆23May 21, 2021Updated 5 years ago
UzL-ITS / tdxdown
View on GitHub
Software Artifacts for the paper "TDXdown: Single-Stepping and Instruction Counting Attacks against Intel TDX"
☆19Oct 14, 2024Updated last year
Jarviswang94 / MMSafetyAwareness
View on GitHub
Multimodal Safety Awareness Benchmark for Large Language Models
☆15Jun 3, 2025Updated last year
hf618 / COSMIC
View on GitHub
☆17Oct 1, 2025Updated 9 months ago
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆15Mar 25, 2025Updated last year