siddheshih / culture-awareness-llmsLinks
☆18Updated last year
Alternatives and similar repositories for culture-awareness-llms
Users that are interested in culture-awareness-llms are comparing it to the libraries listed below
Sorting:
- Crosslingual Reasoning through Test-Time Scaling☆19Updated 7 months ago
- A curated list of research papers and resources on Cultural LLM.☆52Updated last year
- ☆19Updated 9 months ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆42Updated 4 months ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Updated last year
- ☆47Updated 2 months ago
- Official repository for "Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory" accepted at EMNLP Find…☆31Updated last year
- Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"☆23Updated 7 months ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆21Updated last month
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 9 months ago
- ☆34Updated last year
- ☆89Updated 11 months ago
- Code and data for the FACTOR paper☆52Updated 2 years ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆95Updated last year
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆40Updated last year
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- ☆17Updated 2 years ago
- ☆22Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆74Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated 2 years ago
- ☆34Updated 8 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆33Updated last year
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.☆36Updated last year
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆13Updated 3 years ago
- ☆21Updated 2 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Updated 9 months ago
- Awesome LLM for NLG Evaluation Papers☆25Updated last year
- Investigating Cultural Alignment of Large Language Models☆12Updated last year