MSR-LIT / MultilingualBiasLinks
☆10Updated last year
Alternatives and similar repositories for MultilingualBias
Users that are interested in MultilingualBias are comparing it to the libraries listed below
Sorting:
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆55Updated 4 years ago
- ☆48Updated 2 years ago
- ☆58Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆55Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- [ACL 2020] Towards Debiasing Sentence Representations☆66Updated 2 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆88Updated 3 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆62Updated last year
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- ☆17Updated 2 months ago
- ☆14Updated last year
- ☆15Updated 2 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆37Updated 2 years ago
- code associated with ACL 2021 DExperts paper☆115Updated 2 years ago
- ☆29Updated 3 years ago
- ☆33Updated 2 months ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆23Updated 2 months ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆66Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆39Updated 2 years ago
- ☆38Updated last year
- Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".☆17Updated 3 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Dataset + classifier tools to study social perception biases in natural language generation☆69Updated last year
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Updated 3 years ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆34Updated 2 years ago
- ☆44Updated last year
- ☆71Updated 3 years ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆71Updated last year