nlee0212 / BLEnD
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
☆19Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for BLEnD
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆11Updated 2 years ago
- Resources for cultural NLP research☆67Updated this week
- This is official code for the NAACL 2021 paper: "MelBERT: Metaphor Detection via Contextualized Late Interaction usingMetaphorical Identi…☆43Updated last year
- ☆38Updated last year
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆20Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆15Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆23Updated last month
- ☆12Updated 8 months ago
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆33Updated 2 years ago
- ☆40Updated 3 years ago
- Models for automatically transforming toxic text to neutral☆33Updated last year
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆27Updated 3 weeks ago
- WorldCuisines is an extensive multilingual and multicultural benchmark that spans 30 languages, covering a wide array of global cuisines.☆12Updated 3 weeks ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 2 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 2 years ago
- Multicultural Proverbs and Sayings☆10Updated 6 months ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated last year
- ☆25Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated last year
- ☆58Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆63Updated 2 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆60Updated last year
- ☆25Updated 6 months ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆33Updated 2 years ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.☆25Updated last year