RiverGao / CLiKALinks
Evaluation of the Cross-Lingual Knowledge Alignment in LLMs
☆9Updated last year
Alternatives and similar repositories for CLiKA
Users that are interested in CLiKA are comparing it to the libraries listed below
Sorting:
- ☆75Updated 6 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆26Updated this week
- ☆54Updated 10 months ago
- ☆37Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆23Updated 3 months ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆34Updated 10 months ago
- [ACL 2023] kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation☆16Updated last year
- ☆74Updated last year
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Updated 9 months ago
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆39Updated 8 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆65Updated last year
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆19Updated 6 months ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆14Updated last year
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆46Updated last month
- ☆17Updated last year
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Updated 2 years ago
- ☆21Updated last year
- ☆18Updated last year
- ☆25Updated this week
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆62Updated 11 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆55Updated 11 months ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated last year
- [NeurIPS 2024] Can Language Models Learn to Skip Steps?☆18Updated 4 months ago
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Updated last year
- ☆44Updated last year
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆25Updated 3 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year