HrishikeshVish / Fairpy
☆23Updated 6 months ago
Alternatives and similar repositories for Fairpy:
Users that are interested in Fairpy are comparing it to the libraries listed below
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆131Updated 2 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆71Updated 3 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 2 years ago
- ☆38Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Token-level Reference-free Hallucination Detection☆94Updated last year
- ☆124Updated last year
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆21Updated 2 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 2 years ago
- Repository for the Bias Benchmark for QA dataset.☆101Updated last year
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated last year
- ☆44Updated last year
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year
- A curated list of research papers and resources on Cultural LLM.☆36Updated 4 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated last year
- ☆47Updated last year
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆111Updated 11 months ago
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆41Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- [ACL 2020] Towards Debiasing Sentence Representations☆64Updated 2 years ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆14Updated 10 months ago
- ☆25Updated 3 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 7 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆57Updated last year
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆20Updated 3 years ago
- ☆173Updated 6 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 6 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification☆30Updated 2 years ago