thu-coai / Implicit-Toxicity
Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""
☆11Updated last year
Alternatives and similar repositories for Implicit-Toxicity:
Users that are interested in Implicit-Toxicity are comparing it to the libraries listed below
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆25Updated 2 years ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 4 months ago
- ☆43Updated last year
- ☆25Updated 2 years ago
- ☆62Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Controlled Text Generation using Prefix-Tuning on GPT☆19Updated 2 years ago
- ☆11Updated last year
- Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".☆17Updated 2 years ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated last year
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆18Updated last year
- ☆25Updated last year
- ☆52Updated 7 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- ☆9Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆80Updated 6 months ago
- Easy-to-Hard Learning for Information Extraction (ACL 2023 Findings)☆14Updated last year
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- ☆34Updated last year
- Repository for the AAAI 2022 paper "CEM: Commonsense-aware Empathetic Response Generation"☆91Updated last year
- ☆68Updated 3 months ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated 2 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13Updated 2 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆31Updated 2 years ago
- ☆86Updated last year
- ☆25Updated 6 months ago
- ☆17Updated 2 years ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- Awesome LLM for NLG Evaluation Papers☆23Updated last year
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆42Updated 3 years ago