SALT-NLP / CultureBankLinks
☆45Updated last year
Alternatives and similar repositories for CultureBank
Users that are interested in CultureBank are comparing it to the libraries listed below
Sorting:
- ☆18Updated 5 months ago
- Multilingual Large Language Models Evaluation Benchmark☆129Updated last year
- Repository for the Bias Benchmark for QA dataset.☆126Updated last year
- ☆81Updated 8 months ago
- A curated list of research papers and resources on Cultural LLM.☆46Updated 11 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 5 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- Code and data for the FACTOR paper☆51Updated last year
- ☆184Updated last month
- ☆20Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆207Updated last year
- ☆29Updated 8 months ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆25Updated 2 weeks ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆16Updated 2 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆79Updated 4 years ago
- Codebase, data and models for the SummaC paper in TACL☆99Updated 6 months ago
- ☆77Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Updated last year
- templates and other documents regarding responsible NLP research☆70Updated 2 years ago
- ☆15Updated 2 years ago
- ☆75Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆255Updated 2 years ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- Token-level Reference-free Hallucination Detection☆96Updated 2 years ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆30Updated last year
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆61Updated 2 years ago
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆47Updated last year
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆25Updated 5 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆114Updated 11 months ago