[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆14Mar 1, 2025Updated last year
Alternatives and similar repositories for HateBench
Users that are interested in HateBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Releasing open-sourced version of the code used in the paper "Perceptron-based Prefetch Filtering (ISCA 2019)"☆10May 27, 2022Updated 3 years ago
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆26Feb 7, 2026Updated 2 months ago
- 2023秋PKU编译原理lab,以及Koopa IR C++接 口的文档☆16Feb 12, 2024Updated 2 years ago
- The official repository for guided jailbreak benchmark☆29Jul 28, 2025Updated 9 months ago
- Code for Voice Jailbreak Attacks Against GPT-4o.☆38May 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Bayesian Statistics: From Concept to Data Analysis by University of California, Santa Cruz☆22Nov 11, 2017Updated 8 years ago
- 🦊 DISINFOX is a threat intelligence exchange platform for disinformation implementing the DISARM framework at its core.☆51Jul 21, 2025Updated 9 months ago
- ☆13Dec 17, 2025Updated 4 months ago
- ☆14Mar 5, 2026Updated last month
- Embedded Rust Projects☆13Jun 12, 2024Updated last year
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆35Oct 23, 2024Updated last year
- Batch downloader and Scraper for Pico-8 carts.☆18Aug 21, 2025Updated 8 months ago
- The final project of PKU course Compilers: Principles in Spring 2023, a SysY to RISC-V compiler. Document::https://pku-minic.github.io/on…☆21Jun 15, 2023Updated 2 years ago
- An exploration of LLM steering☆26Jun 15, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆14Apr 11, 2024Updated 2 years ago
- 支持Typecho1.1的赞赏功能代码☆15Aug 25, 2018Updated 7 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Benchmark and sample code for the Author Paper Identification Challenge on Kaggle, a part of the 2013 KDD Cup☆33May 23, 2013Updated 12 years ago
- A Keyboard Pad, supported 8 keys + 4 retray encoders or 12 keys.☆12Dec 16, 2023Updated 2 years ago
- ☆31Aug 5, 2015Updated 10 years ago
- ☆15Jul 26, 2024Updated last year
- codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19☆15Feb 25, 2020Updated 6 years ago
- Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM☆39Jan 17, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A dataset consists of 6,387 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 666 jailbreak prompts).☆20Feb 21, 2024Updated 2 years ago
- ☆39May 17, 2025Updated 11 months ago
- ☆42Nov 16, 2024Updated last year
- Dark Flavored - Academic Project Website Template☆17Sep 30, 2024Updated last year
- Game of Thrones Relationship Chart☆13Oct 15, 2019Updated 6 years ago
- Templates for paper submissions, technical questionnaires, etc.☆14Sep 13, 2024Updated last year
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models☆51Jan 11, 2025Updated last year
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Feb 17, 2025Updated last year
- ☆16Nov 18, 2024Updated last year
- ☆11Jan 16, 2022Updated 4 years ago
- 爬虫项目,用来爬取huntr网站的cve相关信息☆12May 10, 2023Updated 2 years ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- ☆23Jan 5, 2026Updated 3 months ago