claws-lab / XLingEvalLinks
Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"
☆16Updated last year
Alternatives and similar repositories for XLingEval
Users that are interested in XLingEval are comparing it to the libraries listed below
Sorting:
- Token-level Reference-free Hallucination Detection☆96Updated 2 years ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- ☆41Updated 2 years ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆20Updated 2 years ago
- ☆51Updated 2 years ago
- code associated with ACL 2021 DExperts paper☆116Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆42Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- ☆92Updated 3 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 3 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆29Updated 3 years ago
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆44Updated 4 months ago
- ☆50Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆80Updated last year
- [ACL 2020] Towards Debiasing Sentence Representations☆66Updated 2 years ago
- Long-context pretrained encoder-decoder models☆96Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆48Updated last year
- ☆39Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆79Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆59Updated 2 years ago
- ☆71Updated 3 years ago
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆141Updated 3 years ago
- To analyze and remove gender bias in coreference resolution systems☆79Updated 4 months ago