TIGER-AI-Lab/TIGERScore

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TIGER-AI-Lab/TIGERScore)

TIGER-AI-Lab / TIGERScore

"TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]

☆32

Alternatives and similar repositories for TIGERScore

Users that are interested in TIGERScore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xu1998hz / SEScore2
View on GitHub
☆17Mar 3, 2025Updated last year
Betswish / MIRAGE
View on GitHub
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆25Mar 10, 2025Updated last year
Xiaoyu-SZ / LLMasEvaluator
View on GitHub
Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)
☆21Aug 13, 2025Updated 11 months ago
hiaoxui / D2T-Grounding
View on GitHub
Learning Latent Semantic Annotations for Grounding Natural Language to Structured Data
☆13Jan 28, 2019Updated 7 years ago
Heidelberg-NLP / CC-SHAP
View on GitHub
Code for "On Measuring Faithfulness of Natural Language Explanations"
☆23Jul 14, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
s-lilo / brat-peek
View on GitHub
Framework for working with brat-annotated .ann files
☆10Mar 16, 2026Updated 4 months ago
jdf-prog / LLM-Gen
View on GitHub
A simple generate script utils using fastchat conv template for generation of Large Language Models
☆21Jun 21, 2023Updated 3 years ago
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
xu1998hz / InstructScore_SEScore3
View on GitHub
First explanation metric (diagnostic report) for text generation evaluation
☆62Mar 3, 2025Updated last year
GChrysostomou / ood_faith
View on GitHub
☆13Jul 26, 2023Updated 2 years ago
zkxinxin / HCMGNN
View on GitHub
Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction
☆12Aug 19, 2024Updated last year
UKPLab / nessie
View on GitHub
Automatically detect errors in annotated corpora.
☆48Sep 8, 2023Updated 2 years ago
shawnricecake / squant
View on GitHub
[ICCAD 2025] Squant
☆15Jul 3, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
archiki / ReCEval
View on GitHub
Supporting code for ReCEval paper
☆32Sep 14, 2024Updated last year
maszhongming / UniEval
View on GitHub
Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation
☆217Feb 10, 2024Updated 2 years ago
jacobkrantz / VertMetric
View on GitHub
VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.
☆12Dec 20, 2018Updated 7 years ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆13Jun 28, 2026Updated 3 weeks ago
jvladika / HealthFC
View on GitHub
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
☆14Apr 11, 2025Updated last year
TIGER-AI-Lab / StructLM
View on GitHub
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆76Oct 19, 2024Updated last year
heolin / agreement
View on GitHub
Implementation of popular agreement metrics such as Cohen kappa, Fleiss kappa, Krippendorff alpha
☆16Apr 2, 2022Updated 4 years ago
LinguisticAnomalies / pls_retrieval
View on GitHub
Repository for paper CELLS: A Parallel Corpus for Biomedical Lay Language Generation
☆19Apr 2, 2024Updated 2 years ago
Yevgnen / pybrat
View on GitHub
Parser for brat rapid annotation tool.
☆15May 2, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chaojun-wang / Exposure-Bias-Hallucination-Domain-Shift
View on GitHub
Introduction and scripts for ACL-2020 paper "On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation"
☆21Jun 23, 2020Updated 6 years ago
zhaoshitian / Causal-CoG
View on GitHub
[CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"
☆17Sep 12, 2024Updated last year
bvanaken / clinical-assertion-data
View on GitHub
Dataset for the NLPMC @ NAACL 2021 Paper: Assertion Detection in Clinical Notes: Medical Language Models to the Rescue?
☆16Sep 28, 2021Updated 4 years ago
xiye17 / EvalQAExpl
View on GitHub
Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.
☆17Apr 25, 2021Updated 5 years ago
JHLew / Learnable-Fourier-Features
View on GitHub
Unofficial pytorch implementation of the paper "Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding", NeurIPS 20…
☆13Apr 24, 2024Updated 2 years ago
chorusai / brave
View on GitHub
Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.
☆15Dec 25, 2019Updated 6 years ago
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
multimodal-art-projection / IV-Bench
View on GitHub
☆14Apr 23, 2025Updated last year
tareknaous / readme
View on GitHub
ReadMe++: A Multi-domain Multilingual Dataset for Readability Assessment
☆13Apr 15, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
princeton-nlp / LLMBar
View on GitHub
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆138Jul 8, 2024Updated 2 years ago
visinf / fast-axiomatic-attribution
View on GitHub
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆15Feb 24, 2026Updated 5 months ago
i-Eval / FairEval
View on GitHub
☆145Sep 10, 2023Updated 2 years ago
liuchengyuan123 / CPAD
View on GitHub
The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".
☆22Feb 5, 2024Updated 2 years ago
Coldmist-Lu / ErrorAnalysis_Prompt
View on GitHub
[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT
☆91Oct 14, 2025Updated 9 months ago
LLM-MI-Research / Actionable-MI
View on GitHub
☆15Jan 20, 2026Updated 6 months ago
UKPLab / TWEAC-qa-agent-selection
View on GitHub
☆20Apr 16, 2021Updated 5 years ago