Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""
☆15Nov 30, 2023Updated 2 years ago
Alternatives and similar repositories for Implicit-Toxicity
Users that are interested in Implicit-Toxicity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆76May 17, 2025Updated 10 months ago
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆113May 25, 2025Updated 10 months ago
- ☆14Jun 17, 2024Updated last year
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023☆28Jan 25, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset (COLING2024 Oral)☆13Jul 22, 2024Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- The unofficial implementation of paper "Facial expression recognition with grid-wise attention and visual transformer"☆17Jul 14, 2022Updated 3 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- ☆14Jan 6, 2025Updated last year
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆26Jul 7, 2024Updated last year
- Unzipped client files☆11Mar 8, 2020Updated 6 years ago
- Third Person Shooter for Unity☆12Jun 26, 2022Updated 3 years ago
- FAST 22☆12Jul 18, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- All-in-One Safety Evaluation Framwork☆47Mar 4, 2026Updated last month
- ☆26Aug 24, 2022Updated 3 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- ☆14Jan 12, 2022Updated 4 years ago
- CCAC2024——大模型安全的双重防线:少样本文本内容安全挑战赛仓库☆30Jun 20, 2024Updated last year
- modified mdb for examining zfs on disk☆21Aug 12, 2013Updated 12 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Adding a Randeng translation model on top of the instructBLIP model to enable Chinese testing of instructBLIP functionality.☆16May 30, 2023Updated 2 years ago
- 此Line bot範例為使用 LineBotSDK 建立的 『圖片、人臉辨識 Bot』 用戶可以傳遞照片給 bot ,它會辨識出照片的內容(圖說),以及照片中的人、性別和、年紀....☆24Mar 5, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Oct 28, 2024Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated 2 years ago
- Training from scratch a character embedding following Word2Vec, using tensorflow.☆14Mar 24, 2023Updated 3 years ago
- An SLA-Oriented LSM-Tree Key-Value Store for High-end Cloud Data Service☆17Jan 24, 2021Updated 5 years ago
- Replication code for "The Structure of Toxic Conversations on Twitter" (WWW'21)☆10May 25, 2021Updated 4 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 3 years ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Nov 22, 2023Updated 2 years ago
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- ICDCS 2021, "StripeMerge: Efficient Wide-Stripe Generation for Large-Scale Erasure-Coded Storage"☆13Jul 19, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Jun 12, 2023Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆20Aug 20, 2021Updated 4 years ago
- A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems, Co-located with EMNLP2022 SereTOD Workshop☆26Oct 1, 2022Updated 3 years ago
- ☆15Jul 6, 2023Updated 2 years ago
- Paraphrase Generation Using Deep Reinforcement Learning - MSc Thesis☆18Jun 10, 2020Updated 5 years ago
- CCA, DCCA, DCCAE, ConvCCA☆21Dec 16, 2020Updated 5 years ago
- ☆19Jan 17, 2021Updated 5 years ago