[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆13Mar 1, 2025Updated last year
Alternatives and similar repositories for HateBench
Users that are interested in HateBench are comparing it to the libraries listed below
Sorting:
- Releasing open-sourced version of the code used in the paper "Perceptron-based Prefetch Filtering (ISCA 2019)"☆10May 27, 2022Updated 3 years ago
- 2023秋PKU编译原理lab,以及Koopa IR C++接口的文档☆16Feb 12, 2024Updated 2 years ago
- The official repository for guided jailbreak benchmark☆29Jul 28, 2025Updated 7 months ago
- Code for Voice Jailbreak Attacks Against GPT-4o.☆37May 31, 2024Updated last year
- Bayesian Statistics: From Concept to Data Analysis by University of California, Santa Cruz☆21Nov 11, 2017Updated 8 years ago
- 🦊 DISINFOX is a threat intelligence exchange platform for disinformation implementing the DISARM framework at its core.☆51Jul 21, 2025Updated 7 months ago
- ☆14Dec 17, 2025Updated 3 months ago
- Embedded Rust Projects☆13Jun 12, 2024Updated last year
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆36Oct 23, 2024Updated last year
- Batch downloader and Scraper for Pico-8 carts.☆18Aug 21, 2025Updated 6 months ago
- The final project of PKU course Compilers: Principles in Spring 2023, a SysY to RISC-V compiler. Document::https://pku-minic.github.io/on…☆21Jun 15, 2023Updated 2 years ago
- ☆14Apr 11, 2024Updated last year
- An exploration of LLM steering☆25Jun 15, 2024Updated last year
- 支持Typecho1.1的赞赏功能代码☆15Aug 25, 2018Updated 7 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Benchmark and sample code for the Author Paper Identification Challenge on Kaggle, a part of the 2013 KDD Cup☆33May 23, 2013Updated 12 years ago
- A Keyboard Pad, supported 8 keys + 4 retray encoders or 12 keys.☆12Dec 16, 2023Updated 2 years ago
- ☆31Aug 5, 2015Updated 10 years ago
- ☆14Jul 26, 2024Updated last year
- ☆39May 17, 2025Updated 10 months ago
- codes for paper "learning to discriminate perturbations for blocking adversarial attacks in text classification" in EMNLP19☆15Feb 25, 2020Updated 6 years ago
- ☆42Nov 16, 2024Updated last year
- A dataset consists of 6,387 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 666 jailbreak prompts).☆19Feb 21, 2024Updated 2 years ago
- Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM☆39Jan 17, 2025Updated last year
- Dark Flavored - Academic Project Website Template☆17Sep 30, 2024Updated last year
- Game of Thrones Relationship Chart☆13Oct 15, 2019Updated 6 years ago
- Templates for paper submissions, technical questionnaires, etc.☆14Sep 13, 2024Updated last year
- [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models☆51Jan 11, 2025Updated last year
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated 11 months ago
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- ☆16Feb 17, 2025Updated last year
- A curated list of Soft Robotics resources, projects, courses, books, video lectures, papers, journals and articles.☆16Mar 5, 2024Updated 2 years ago
- ☆16Nov 18, 2024Updated last year
- ☆11Jan 16, 2022Updated 4 years ago
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆14Jun 24, 2025Updated 8 months ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- 爬虫项目,用来爬取huntr网站的cve相关信息☆12May 10, 2023Updated 2 years ago
- [NDSS'25] The official implementation of safety misalignment.☆17Jan 8, 2025Updated last year
- ☆23Jan 5, 2026Updated 2 months ago