Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""
☆15Nov 30, 2023Updated 2 years ago
Alternatives and similar repositories for Implicit-Toxicity
Users that are interested in Implicit-Toxicity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆118May 25, 2025Updated 11 months ago
- CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)☆17Feb 10, 2025Updated last year
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023☆28Jan 25, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple lisp interpreter☆11Apr 19, 2020Updated 6 years ago
- SarcNet: A Multilingual Multimodal Sarcasm Detection Dataset (COLING2024 Oral)☆14Jul 22, 2024Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆22Dec 16, 2024Updated last year
- The unofficial implementation of paper "Facial expression recognition with grid-wise attention and visual transformer"☆17Jul 14, 2022Updated 3 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- ☆14Jan 6, 2025Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆26Jul 7, 2024Updated last year
- Unzipped client files☆11Mar 8, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Third Person Shooter for Unity☆12Jun 26, 2022Updated 3 years ago
- A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling☆15Dec 5, 2023Updated 2 years ago
- ☆44Jun 29, 2023Updated 2 years ago
- All-in-One Safety Evaluation Framwork☆49Apr 21, 2026Updated 2 weeks ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 3 years ago
- ☆14Jan 12, 2022Updated 4 years ago
- CCAC2024——大模型安全的双重防线:少样本文本内容安全挑战赛仓库☆30Jun 20, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- modified mdb for examining zfs on disk☆21Aug 12, 2013Updated 12 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…☆15May 5, 2024Updated 2 years ago
- 此Line bot範例為使用 LineBotSDK 建立的 『圖片、人臉辨識 Bot』 用戶可以傳遞照片給 bot ,它會辨識出照片的內容(圖說),以及照片中的人、性別和、年紀....☆24Mar 5, 2019Updated 7 years ago
- ☆10Oct 28, 2024Updated last year
- ☆17Mar 23, 2021Updated 5 years ago
- ☆12Oct 20, 2020Updated 5 years ago
- Training from scratch a character embedding following Word2Vec, using tensorflow.☆14Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An SLA-Oriented LSM-Tree Key-Value Store for High-end Cloud Data Service☆17Jan 24, 2021Updated 5 years ago
- Code for the paper "You Truly Understand What I Need : Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona" which i…☆23Apr 6, 2023Updated 3 years ago
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- ☆11Oct 16, 2023Updated 2 years ago
- Dataset and code implementation for the paper "Decoding the Underlying Meaning of Multimodal Hateful Memes" (IJCAI'23).☆20Jun 15, 2023Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆20Aug 20, 2021Updated 4 years ago
- A challenge on Semi-Supervised and Reinforced Task-Oriented Dialog Systems, Co-located with EMNLP2022 SereTOD Workshop☆26Oct 1, 2022Updated 3 years ago