chongyangtao/LLMs-for-NLG-Evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chongyangtao/LLMs-for-NLG-Evaluation)

chongyangtao / LLMs-for-NLG-Evaluation

Awesome LLM for NLG Evaluation Papers

☆26

Alternatives and similar repositories for LLMs-for-NLG-Evaluation

Users that are interested in LLMs-for-NLG-Evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
overfit-brothers / KRX-2024
View on GitHub
☆12Dec 20, 2024Updated last year
SiyuanWangw / ULogic
View on GitHub
☆23Aug 1, 2024Updated last year
SeoroMin / Prompt4LLM-Eval
View on GitHub
☆19Nov 26, 2023Updated 2 years ago
kakao / diatool-dpo
View on GitHub
☆15Aug 25, 2025Updated 11 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
awslabs / durepa-hybrid-qa
View on GitHub
☆12Mar 22, 2024Updated 2 years ago
thu-coai / OpenMEVA
View on GitHub
Benchmark for evaluating open-ended generation
☆50Nov 6, 2024Updated last year
ulab-uiuc / GraphEval
View on GitHub
[ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You
☆17Mar 18, 2025Updated last year
HDKG / HTKG
View on GitHub
SIGIR 2022 CODE
☆10Apr 1, 2022Updated 4 years ago
SiyuanWangw / StepwiseQA
View on GitHub
The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".
☆22Sep 1, 2022Updated 3 years ago
sayakpaul / BiT-jax2tf
View on GitHub
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Dec 21, 2021Updated 4 years ago
ClustProject / KUDataMultitasklearning
View on GitHub
☆25Nov 24, 2023Updated 2 years ago
meituan / vitabench
View on GitHub
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
☆23Oct 17, 2025Updated 9 months ago
zycdev / AISO
View on GitHub
Authors' implementation of the paper Adaptive Information Seeking for Open-Domain Question Answering, published in EMNLP 2021.
☆39May 16, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sb-jang / kodialogbench
View on GitHub
Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…
☆18Apr 15, 2025Updated last year
maj34 / Legal_Specific_KoLLM
View on GitHub
[ Text Analytics ] 법률 도메인 특화 한국어 기반 LLM 개발
☆15Sep 14, 2025Updated 10 months ago
donggyukimc / Inverse-cloze-task
View on GitHub
Test code of Inverse cloze task for information retrieval
☆33Jan 10, 2021Updated 5 years ago
hist0613 / arxivbot
View on GitHub
☆61Updated this week
K-Kuyama / yet-another-UI-for-AW
View on GitHub
UI for ActivityWatch. Include category editor and viewer for multiple categorizations.
☆10Jan 31, 2024Updated 2 years ago
LostCow / KLUE
View on GitHub
KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)
☆25Apr 11, 2022Updated 4 years ago
nlx-group / Shortcutted-Commonsense-Reasoning
View on GitHub
Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning", Outstanding Paper at EMNLP20…
☆10Nov 7, 2021Updated 4 years ago
detule / linux-hexagon
View on GitHub
Linux kernel for Qualcomm's Hexagon processors
☆10Mar 14, 2013Updated 13 years ago
DaoD / KPN
View on GitHub
SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals
☆11Jul 30, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jzbjyb / lm-calibration
View on GitHub
☆34Nov 17, 2021Updated 4 years ago
keiji / region_cropper
View on GitHub
Help creating image dataset for machine learning.
☆10Nov 4, 2020Updated 5 years ago
HKUST-KnowComp / MICO
View on GitHub
This is the code repo for Findings of EMNLP2022 paper: MICO: a multi-alternative contrastive learning framework for commonsense knowledg…
☆10Nov 29, 2022Updated 3 years ago
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
yukyunglee / transformers-resources
View on GitHub
huggingface transformers tutorial, code, resources
☆26Apr 7, 2024Updated 2 years ago
activatedgeek / calibration-tuning
View on GitHub
☆53Apr 9, 2025Updated last year
parameterlab / apricot
View on GitHub
Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024
☆22Nov 20, 2024Updated last year
MySong7NLPer / Presentations-Notes
View on GitHub
Presentations & Notes
☆11May 14, 2022Updated 4 years ago
jiacheng-ye / kg_gater
View on GitHub
[EMNLP 2021] Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”
☆14Nov 13, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yilunzhao / Awsome-Table-Reasoning
View on GitHub
A comprehensive paper list of Reasoning over Tables.
☆30Nov 6, 2022Updated 3 years ago
smilegate-ai / HuLiC
View on GitHub
☆93Mar 3, 2022Updated 4 years ago
hyintell / RetrievalQA
View on GitHub
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆68May 28, 2024Updated 2 years ago
muhaochen / bilingual_dictionaries
View on GitHub
This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…
☆12Oct 1, 2020Updated 5 years ago
boostcampaitech2 / final-project-level3-nlp-08
View on GitHub
Look, Attend and Generate Poem - 사진을 보고 시를 써내려가는 감성시인 서비스
☆25Jan 20, 2022Updated 4 years ago
MySong7NLPer / AI-Conference-Acceptance-Rate
View on GitHub
☆11Aug 8, 2022Updated 3 years ago
heyLinsir / Semantic-based-QA
View on GitHub
Code of "A Semantic-based Method for Unsupervised Commonsense Question Answering"
☆14Jul 29, 2021Updated 5 years ago