[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆14Mar 1, 2025Updated last year
Alternatives and similar repositories for HateBench
Users that are interested in HateBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆28Feb 7, 2026Updated 4 months ago
- 2023秋PKU编译原理lab,以及Koopa IR C++接口的文档☆16Feb 12, 2024Updated 2 years ago
- The official repository for guided jailbreak benchmark☆30Jul 28, 2025Updated 11 months ago
- Code for Voice Jailbreak Attacks Against GPT-4o.☆38May 31, 2024Updated 2 years ago
- Bayesian Statistics: From Concept to Data Analysis by University of California, Santa Cruz☆22Nov 11, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🦊 DISINFOX is a threat intelligence exchange platform for disinformation implementing the DISARM framework at its core.☆52Jul 21, 2025Updated 11 months ago
- ☆14Dec 17, 2025Updated 6 months ago
- ☆15Mar 5, 2026Updated 3 months ago
- Embedded Rust Projects☆13Jun 12, 2024Updated 2 years ago
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆38Oct 23, 2024Updated last year
- The final project of PKU course Compilers: Principles in Spring 2023, a SysY to RISC-V compiler. Document::https://pku-minic.github.io/on…☆21Jun 15, 2023Updated 3 years ago
- Batch downloader and Scraper for Pico-8 carts.☆18Aug 21, 2025Updated 10 months ago
- ☆14Apr 11, 2024Updated 2 years ago
- An exploration of LLM steering☆28Jun 15, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 支持Typecho1.1的赞赏功能代码☆15Aug 25, 2018Updated 7 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆12Jun 18, 2024Updated 2 years ago
- Benchmark and sample code for the Author Paper Identification Challenge on Kaggle, a part of the 2013 KDD Cup☆33May 23, 2013Updated 13 years ago
- A Keyboard Pad, supported 8 keys + 4 retray encoders or 12 keys.☆12Dec 16, 2023Updated 2 years ago
- ☆31Aug 5, 2015Updated 10 years ago
- ☆16Jul 26, 2024Updated last year
- codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19☆15Feb 25, 2020Updated 6 years ago
- Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM☆39Jan 17, 2025Updated last year
- A dataset consists of 6,387 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 666 jailbreak prompts).☆22Feb 21, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆39May 17, 2025Updated last year
- Dark Flavored - Academic Project Website Template☆17Sep 30, 2024Updated last year
- Game of Thrones Relationship Chart☆13Oct 15, 2019Updated 6 years ago
- Templates for paper submissions, technical questionnaires, etc.☆14Sep 13, 2024Updated last year
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- ☆79Jun 3, 2026Updated 3 weeks ago
- [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models☆52Jan 11, 2025Updated last year
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- ☆17Feb 17, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆17Nov 18, 2024Updated last year
- ☆10Jan 16, 2022Updated 4 years ago
- 爬虫项目,用来爬取huntr网站的cve相关信息☆12May 10, 2023Updated 3 years ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- ☆27Jan 5, 2026Updated 5 months ago
- (Very) basic standalone synths☆21Jul 15, 2024Updated last year