GEM-benchmark/GEM-metrics

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GEM-benchmark/GEM-metrics)

GEM-benchmark / GEM-metrics

Automatic metrics for GEM tasks

☆69

Alternatives and similar repositories for GEM-metrics

Users that are interested in GEM-metrics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ufal / perin
View on GitHub
PERIN is Permutation-Invariant Semantic Parser developed for MRP 2020
☆45Aug 26, 2022Updated 3 years ago
jmhessel / pycocoevalcap
View on GitHub
Python 3 support for the MS COCO caption evaluation tools
☆14Jun 14, 2024Updated 2 years ago
teddysum / Korean_SC_2023
View on GitHub
☆10Oct 28, 2024Updated last year
leo-liuzy / probe-across-time
View on GitHub
☆22Aug 31, 2021Updated 4 years ago
jungokasai / twist_decoding
View on GitHub
☆30May 20, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
krishnap25 / mauve-experiments
View on GitHub
☆38May 14, 2024Updated 2 years ago
jeknov / EMNLP_17_submission
View on GitHub
The dataset and statistical analysis code released with the submission of EMNLP 2017 paper "Why We Need New Evaluation Metrics for NLG"
☆19Nov 16, 2021Updated 4 years ago
danieldeutsch / repro
View on GitHub
Repro is a library for easily running code from published papers via Docker.
☆42Sep 22, 2023Updated 2 years ago
convei-lab / BotsTalk
View on GitHub
🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…
☆16Oct 7, 2024Updated last year
google / BEGIN-dataset
View on GitHub
A benchmark dataset for evaluating dialog system and natural language generation metrics.
☆39Jun 13, 2022Updated 4 years ago
jxhe / self-training-text-generation
View on GitHub
Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"
☆46Jun 30, 2022Updated 4 years ago
harsh19 / Structured-Adversary
View on GitHub
"Learning Rhyming Constraints using Structured Adversaries. Jhamtani H., Mehta S., Carbonell J., Berg-Kirkpatrick T. EMNLP-IJCNLP (Short …
☆11Mar 17, 2020Updated 6 years ago
amy-hyunji / Generative-Multihop-Retrieval
View on GitHub
☆33Mar 31, 2023Updated 3 years ago
dreasysnail / CoCon
View on GitHub
Consistent dialogue generation
☆16Oct 26, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
joowon-dm-snu / fastcampus-chatgpt-intro-frameworks
View on GitHub
☆19Nov 7, 2023Updated 2 years ago
qkaren / unsup_gen_for_cms_reasoning
View on GitHub
☆49Jun 12, 2023Updated 3 years ago
soheeyang / unified-prompt-selection
View on GitHub
[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
☆11Nov 14, 2024Updated last year
alexa / alexa-with-dstc10-track2-dataset
View on GitHub
DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations
☆64Jul 25, 2023Updated 3 years ago
alexa / Commonsense-Dialogues
View on GitHub
A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.
☆80Sep 15, 2021Updated 4 years ago
Tiiiger / templm
View on GitHub
Code release for "TempLM: Distilling Language Models into Template-Based Generators"
☆14Jul 21, 2022Updated 4 years ago
kaistAI / How-Well-Do-LLMs-Truly-Ground
View on GitHub
☆11Sep 19, 2025Updated 10 months ago
ufal / augpt
View on GitHub
DSTC9 Submission
☆16Apr 12, 2021Updated 5 years ago
rycolab / uid-decoding
View on GitHub
☆42Mar 8, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HarshTrivedi / phd-advice
View on GitHub
A list of advisory blogs and resources that I have found useful so far.
☆22Nov 25, 2020Updated 5 years ago
iitmnlp / Dialogue-Evaluation-with-BERT
View on GitHub
☆31Jan 16, 2021Updated 5 years ago
Breakend / EthicsInDialogue
View on GitHub
☆18Feb 14, 2018Updated 8 years ago
Gringham / explainable-metrics-machine-translation
View on GitHub
explainable-machine-translation-metrics
☆12Jul 15, 2022Updated 4 years ago
wwxu21 / AMR-SG
View on GitHub
☆20Sep 17, 2021Updated 4 years ago
michaelachmann / gpt-cost-estimator
View on GitHub
A cost estimator for OpenAI API calls in tqdm loops.
☆20Nov 25, 2024Updated last year
facebookresearch / perfect
View on GitHub
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models
☆109Dec 16, 2025Updated 7 months ago
human-rights-corpus / HRC
View on GitHub
#인권코퍼스
☆31Oct 6, 2023Updated 2 years ago
haven-jeon / KoGPT2-subtasks
View on GitHub
NSMC, KorSTS ... fine-tunings
☆18Feb 23, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
doc2dial / sharedtask-dialdoc2021
View on GitHub
doc2dial data includes a set of documents from multiple domains; and conversations between an assisting agent and an end user that are gr…
☆41Jan 8, 2022Updated 4 years ago
kyle8581 / DialogueCoT
View on GitHub
[EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)
☆11Nov 15, 2023Updated 2 years ago
muhaochen / bilingual_dictionaries
View on GitHub
This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…
☆12Oct 1, 2020Updated 5 years ago
anthonywchen / MOCHA
View on GitHub
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16May 3, 2022Updated 4 years ago
GEM-benchmark / NL-Augmenter
View on GitHub
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
☆786May 19, 2024Updated 2 years ago
google / airdialogue
View on GitHub
☆47Apr 24, 2022Updated 4 years ago
adapter-hub / Hub
View on GitHub
ARCHIVED. Please use https://docs.adapterhub.ml/huggingface_hub.html || 🔌 A central repository collecting pre-trained adapter modules
☆69May 26, 2024Updated 2 years ago