dxlong2000 / FormatBiasEvalLinks
[Preprint' 24] LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
☆11Updated last year
Alternatives and similar repositories for FormatBiasEval
Users that are interested in FormatBiasEval are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 years ago
- code for "GLEN: General-Purpose Event Detection for Thousands of Types"☆13Updated last year
- ☆15Updated 2 years ago
- ☆30Updated 10 months ago
- ☆43Updated 2 years ago
- A comprehensive paper list of Reasoning over Tables.☆29Updated 2 years ago
- Awesome LLM for NLG Evaluation Papers☆25Updated last year
- FRANK: Factuality Evaluation Benchmark☆59Updated 2 years ago
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.☆35Updated last year
- Codebase, data and models for the SummaC paper in TACL☆102Updated 8 months ago
- The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understandi…☆17Updated last year
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆63Updated 2 years ago
- ☆10Updated 2 years ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆63Updated 2 years ago
- [ACL 2023] S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering☆20Updated 4 months ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆32Updated 6 months ago
- Codes for ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆31Updated 2 years ago
- ☆24Updated last year
- ☆65Updated 10 months ago
- ☆15Updated 4 years ago
- ☆51Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆149Updated 2 months ago
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆17Updated 2 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆125Updated last year
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated 2 years ago
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆65Updated last month
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆56Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆36Updated 3 years ago
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆56Updated 2 years ago