RiverGao / CLiKALinks

Evaluation of the Cross-Lingual Knowledge Alignment in LLMs

☆9

Alternatives and similar repositories for CLiKA

Users that are interested in CLiKA are comparing it to the libraries listed below

Sorting:

RUCAIBox / Language-Specific-Neurons
☆79Updated 7 months ago
qinyiwei / InfoBench
☆55Updated 11 months ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
Betswish / Cross-Lingual-Consistency
Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…
☆25Updated 5 months ago
SeaEval / SeaEval
NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
☆25Updated 5 months ago
NJUNLP / QAlign
☆38Updated last year
cordercorder / knn-models
A retrieval augmented sequence modeling toolkit implemented based on Fairseq
☆29Updated 2 years ago
ictnlp / TACS
Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts
☆17Updated 11 months ago
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆68Updated last year
katiekang1998 / llm_hallucinations
☆17Updated last year
GAIR-NLP / alignment-for-honesty
☆74Updated last year
zthang / Focus
☆20Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆62Updated last year
zhliu0106 / probing-lm-data
Official Implementation of "Probing Language Models for Pre-training Data Detection"
☆19Updated 8 months ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆114Updated 10 months ago
cylnlp / convsumx
Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation
☆19Updated last year
NJUNLP / knn-box
an easy-to-use knn-mt toolkit
☆104Updated last year
yasumasaonoe / entity_knowledge_propagation
☆17Updated 2 years ago
fanqiwan / Explore-Instruct
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
☆36Updated last year
wangcunxiang / QA-Eval
The repository for paper <Evaluating Open-QA Evaluation>
☆25Updated last year
nayeon7lee / FactualityPrompt
☆87Updated 2 years ago
zzhang0179 / Unveiling-Linguistic-Regions-in-LLMs
[ACL 2024] Unveiling Linguistic Regions in Large Language Models
☆31Updated last year
YJiangcm / FollowBench
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
☆108Updated last month
hexuandeng / Mono4SiMT
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
☆12Updated 2 years ago
zhaochen0110 / conflictbank
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…
☆46Updated 2 months ago
AI21Labs / factor
Code and data for the FACTOR paper
☆51Updated last year
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆56Updated last year
Spico197 / MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
☆40Updated 10 months ago
HKUST-KnowComp / Knowledge-Constrained-Decoding
Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…
☆30Updated last year
iwangjian / TopDial
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)
☆30Updated last year