hlt-mt / pangolinnLinks
As a Pangolin looks for bugs and catches them, the goal of this library is ot help developers finding bugs in their neural networks and newly-created models.
☆13Updated last year
Alternatives and similar repositories for pangolinn
Users that are interested in pangolinn are comparing it to the libraries listed below
Sorting:
- The geometry of multilingual language model representations (EMNLP 2022).☆21Updated 2 years ago
- Crosslingual Reasoning through Test-Time Scaling☆17Updated 3 weeks ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆23Updated 2 months ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated last year
- ☆18Updated 11 months ago
- ☆20Updated 6 months ago
- Measuring the Mixing of Contextual Information in the Transformer☆29Updated 2 years ago
- ☆26Updated 2 years ago
- Can Large Language Models Be an Alternative to Human Evaluations?☆9Updated last year
- ☆82Updated 2 years ago
- ☆31Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Updated last year
- ☆35Updated 11 months ago
- Multilingual Large Language Models Evaluation Benchmark☆122Updated 9 months ago
- ☆31Updated 4 months ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆53Updated 3 weeks ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Updated last year
- Implementation of "Can we obtain significant success in RST discourse parsing by using Large Language Models?" (accepted by EACL 2024)☆15Updated last year
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆15Updated 2 years ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆25Updated 3 months ago
- ☆15Updated 2 years ago
- Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022☆98Updated last year
- Split bib files for anthology bibliography for overleaf☆10Updated 9 months ago
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆47Updated last year
- Codebase, data and models for the SummaC paper in TACL☆94Updated 4 months ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆82Updated 4 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Updated 2 years ago
- The dataset and code for PeerSum at EMNLP'23.☆14Updated last year