Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper here: https://aclanthology.org/2023.emnlp-main.658/
☆27Aug 8, 2025Updated 7 months ago
Alternatives and similar repositories for Cross-Lingual-Consistency
Users that are interested in Cross-Lingual-Consistency are comparing it to the libraries listed below
Sorting:
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly☆25Jan 6, 2026Updated 2 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Transfer learning for neural machine translation using cross-lingual word embeddings☆10Dec 17, 2025Updated 2 months ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 11 months ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31May 11, 2020Updated 5 years ago
- ☆15Mar 8, 2024Updated 2 years ago
- A simple, Python-based, command-line runner for MGIZA++.☆10Mar 24, 2022Updated 3 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- ☆48Jan 21, 2024Updated 2 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Sep 4, 2024Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆22Oct 21, 2022Updated 3 years ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Aug 18, 2023Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 9 months ago
- ☆21Feb 13, 2023Updated 3 years ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Sep 30, 2022Updated 3 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Aug 29, 2025Updated 6 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆88Sep 12, 2024Updated last year
- ☆267Aug 1, 2025Updated 7 months ago
- Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)☆33Oct 11, 2022Updated 3 years ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago
- ☆31Apr 27, 2022Updated 3 years ago
- ☆10Oct 2, 2024Updated last year
- ☆14Jul 12, 2025Updated 7 months ago
- URIEL+ knowledge base for natural language processing☆17Dec 16, 2025Updated 2 months ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆47Feb 19, 2025Updated last year
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆41Mar 7, 2025Updated last year
- code for Teaching LM to Translate with Comparison☆39Dec 15, 2023Updated 2 years ago
- ☆14Aug 7, 2019Updated 6 years ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated 10 months ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Feb 12, 2026Updated 3 weeks ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago