kojima-takeshi188 / lang_neuronLinks
☆20Updated last year
Alternatives and similar repositories for lang_neuron
Users that are interested in lang_neuron are comparing it to the libraries listed below
Sorting:
- ☆41Updated last year
- Crosslingual Reasoning through Test-Time Scaling☆19Updated 6 months ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Updated 8 months ago
- The geometry of multilingual language model representations (EMNLP 2022).☆22Updated 3 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆27Updated 3 months ago
- ☆43Updated 2 years ago
- ☆87Updated 11 months ago
- Measuring the Mixing of Contextual Information in the Transformer☆33Updated 2 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Updated last year
- an easy-to-use knn-mt toolkit☆105Updated 2 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Updated 2 years ago
- ☆15Updated 3 years ago
- ☆17Updated 2 years ago
- ☆29Updated 11 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆42Updated last year
- ☆86Updated 3 years ago
- ☆62Updated 3 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Updated 2 years ago
- ☆38Updated last year
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆63Updated last year
- Constrained Decoding Project☆19Updated 2 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆61Updated last year
- Monitoring the health of ARR☆26Updated last month
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- ☆79Updated last year
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 8 months ago
- ☆82Updated 2 years ago
- ☆20Updated last year