nlee0212 / BLEnD
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
☆28Updated 3 months ago
Alternatives and similar repositories for BLEnD:
Users that are interested in BLEnD are comparing it to the libraries listed below
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago
- ☆32Updated last week
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆70Updated 10 months ago
- Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".☆17Updated 2 years ago
- Script to pre-train hugginface transformers BART with Tensorflow 2☆33Updated last year
- ☆38Updated last year
- ☆10Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- ☆20Updated last year
- ☆10Updated 2 years ago
- code associated with ACL 2021 DExperts paper☆114Updated last year
- ☆9Updated 3 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 2 years ago
- ☆10Updated 6 months ago
- ☆58Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆41Updated 6 months ago
- ☆15Updated last week
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- Benchmarking Commonsense Reasoning in Real-World Tasks☆12Updated last year
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆24Updated 2 years ago
- ☆29Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated 11 months ago
- Fairlex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing☆12Updated last year
- ☆26Updated 5 months ago
- Multicultural Proverbs and Sayings☆11Updated 2 months ago
- ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarization☆15Updated last year
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)☆10Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆24Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆20Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago