google-research-datasets/seahorse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research-datasets/seahorse)

google-research-datasets / seahorse

Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 quality dimensions: comprehensibility, repetition, grammar, attribution, main idea(s), and conciseness, covering 6 languages, 9 systems and 4 datasets.

☆90

Alternatives and similar repositories for seahorse

Users that are interested in seahorse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dialogue-evaluation / RuCoCo-2023
View on GitHub
Russian coreference resolution competition
☆10Mar 24, 2023Updated 3 years ago
yunan4nlp / E-NNRSTParser
View on GitHub
A neural RST discourse parser with well pre-trained XLNet.
☆17Jun 13, 2022Updated 4 years ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
amir-zeldes / rst2dep
View on GitHub
Converter for Rhetorical Structure Theory (RST) trees to dependency representation
☆17Aug 21, 2025Updated 11 months ago
google-research-datasets / QAmeleon
View on GitHub
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Aug 15, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Yale-LILY / SummEval
View on GitHub
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
☆415Jun 23, 2024Updated 2 years ago
RiTUAL-MBZUAI / DA_NER
View on GitHub
“Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition” (EMNLP 2022)
☆16Feb 2, 2023Updated 3 years ago
disrpt / sharedtask2023
View on GitHub
Repository for DISRPT2023 shared task
☆17Jul 26, 2024Updated last year
machamp-nlp / machamp
View on GitHub
Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/
☆91Jun 3, 2026Updated last month
jacobeisenstein / unix-sociolinguistics
View on GitHub
How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell
☆10Apr 19, 2016Updated 10 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
gsarti / pecore
View on GitHub
Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑
☆16Apr 18, 2024Updated 2 years ago
nazneenrajani / interpreting-ml-models-course
View on GitHub
Course for Interpreting ML Models
☆52Feb 16, 2023Updated 3 years ago
Yao-Dou / LENS
View on GitHub
☆25May 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cisnlp / GlotCC
View on GitHub
[NeurIPS 2024] 🕸 GlotCC Dataset and Pipline
☆21Apr 6, 2025Updated last year
OnionGrief / Chipollino
View on GitHub
преобразования регулярных выражений и конечных автоматов
☆21Feb 26, 2025Updated last year
thu-coai / OpenMEVA
View on GitHub
Benchmark for evaluating open-ended generation
☆50Nov 6, 2024Updated last year
tnq177 / witwicky
View on GitHub
Witwicky: An implementation of Transformer in PyTorch.
☆22Aug 17, 2020Updated 5 years ago
google-research-datasets / swim-ir
View on GitHub
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆50Nov 13, 2023Updated 2 years ago
jzbjyb / X-FACTR
View on GitHub
☆24Jun 12, 2023Updated 3 years ago
ruizheng20 / robust_ticket
View on GitHub
Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)
☆20Jul 18, 2022Updated 4 years ago
seq-to-mind / coref_dial_summ
View on GitHub
One implementation of the paper "Coreference-Aware Dialogue Summarization".
☆20Nov 9, 2023Updated 2 years ago
webis-de / summary-workbench
View on GitHub
Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.
☆33May 13, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
pavel-blinov / RuMedBench
View on GitHub
https://arxiv.org/abs/2201.06499
☆29Apr 9, 2024Updated 2 years ago
littlehacker26 / Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
View on GitHub
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…
☆27Nov 13, 2023Updated 2 years ago
lamethods / labook-code
View on GitHub
☆16Apr 21, 2024Updated 2 years ago
fajri91 / discourse_probing
View on GitHub
Discourse Probing of Pretrained Language Models. In Proceedings of NAACL 2021.
☆10Jun 27, 2022Updated 4 years ago
jennhu / lm-pragmatics
View on GitHub
Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"
☆11Dec 14, 2022Updated 3 years ago
dig-team / hanna-benchmark-asg
View on GitHub
HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.
☆38Oct 15, 2024Updated last year
rudinger / defeasible-nli
View on GitHub
Defeasible Natural Language Inference
☆14Dec 4, 2020Updated 5 years ago
Helsinki-NLP / MuCoW
View on GitHub
Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation
☆18Jan 18, 2021Updated 5 years ago
salesforce / localization-xml-mt
View on GitHub
A High-Quality Multilingual Dataset for Structured Documentation Translation
☆39May 1, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kwchurch / ACL2022_deepnets_tutorial
View on GitHub
Materials for ACL-2022 tutorial: A Gentle Introduction to Deep Nets and Opportunities for the Future
☆17May 24, 2022Updated 4 years ago
bloomberg / entsum
View on GitHub
Open Source / ENTSUM: A Data Set for Entity-Centric Extractive Summarization
☆29May 23, 2022Updated 4 years ago
rpellerin / raspberry-pi-home-automation
View on GitHub
How to build a security camera with a Raspberry Pi
☆10Jun 24, 2026Updated 3 weeks ago
seq-to-mind / DMRST_Parser
View on GitHub
One implementation of the paper "DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing".
☆43Jan 26, 2023Updated 3 years ago
sileod / DiscSense
View on GitHub
Automated Semantic Analysis of Discourse Markers
☆11May 30, 2022Updated 4 years ago
vidhishanair / FactEdit
View on GitHub
☆14Aug 30, 2023Updated 2 years ago
tagoyal / factuality-datasets
View on GitHub
☆46May 26, 2023Updated 3 years ago