nyu-mll/quality

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nyu-mll/quality)

nyu-mll / quality

☆152

Alternatives and similar repositories for quality

Users that are interested in quality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

redwoodresearch / interp
View on GitHub
Redwood Research's transformer interpretability tools
☆15Apr 15, 2022Updated 4 years ago
allenai / natural-perturbations
View on GitHub
Natural Perturbation for Robust Question Answering
☆12Apr 7, 2020Updated 6 years ago
OriShapira / LitePyramids
View on GitHub
Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.
☆12May 5, 2019Updated 7 years ago
tau-nlp / scrolls
View on GitHub
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆69Jan 12, 2024Updated 2 years ago
neuralmind-ai / information-extraction-t5
View on GitHub
☆12Apr 29, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
google-research / longt5
View on GitHub
☆183May 26, 2023Updated 3 years ago
psunlpgroup / MACSum
View on GitHub
Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.
☆34Jul 25, 2023Updated 3 years ago
google-research / bleurt
View on GitHub
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆794Aug 4, 2023Updated 2 years ago
mbollmann / sonnet-finder
View on GitHub
Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.
☆13Jan 5, 2023Updated 3 years ago
martiansideofthemoon / relic-retrieval
View on GitHub
Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).
☆20May 14, 2022Updated 4 years ago
tau-nlp / zero_scrolls
View on GitHub
Running inference on the ZeroSCROLLS benchmark
☆22Apr 18, 2024Updated 2 years ago
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
unicamp-dl / Lite-T5-Translation
View on GitHub
☆27Jan 23, 2024Updated 2 years ago
inspired-cognition / critique-apps
View on GitHub
Apps built using Inspired Cognition's Critique.
☆56Mar 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
feyzaakyurek / bbnli
View on GitHub
Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…
☆15Apr 28, 2022Updated 4 years ago
facebookresearch / irt-leaderboard
View on GitHub
Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…
☆18Mar 30, 2022Updated 4 years ago
ZhangShiyue / Lite2-3Pyramid
View on GitHub
☆16May 22, 2022Updated 4 years ago
oaimli / PeerSum
View on GitHub
The dataset and code for PeerSum at EMNLP'23.
☆16Oct 20, 2025Updated 9 months ago
ChicagoHAI / decsum
View on GitHub
Implementation for Decision-focused Summarization (EMNLP2021)
☆12Mar 14, 2022Updated 4 years ago
bcdnlp / PRD
View on GitHub
PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations
☆12Apr 21, 2024Updated 2 years ago
deep-spin / hallucinations-in-nmt
View on GitHub
☆20Jan 16, 2024Updated 2 years ago
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆780Apr 7, 2023Updated 3 years ago
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cryingjin / AMIOK
View on GitHub
[제 11회 투빅스 컨퍼런스] AM I OK ? - 전문의 답변 기반 심리진단 AI
☆12Jan 19, 2021Updated 5 years ago
allenai / contrast-sets
View on GitHub
☆60Jun 12, 2023Updated 3 years ago
utcsnlp / lfqa_discourse
View on GitHub
A repository for ACL 2022 paper "How do we answer complex questions: Discourse structure of long form answers"
☆19May 31, 2025Updated last year
facebookresearch / anli
View on GitHub
Adversarial Natural Language Inference Benchmark
☆402May 12, 2022Updated 4 years ago
tuhinjubcse / CreativeNLG
View on GitHub
A Repo to curate all creative NLG papers
☆82Aug 9, 2022Updated 3 years ago
IlanPrice / DCTpS
View on GitHub
Code for testing DCT plus Sparse (DCTpS) networks
☆14Jun 15, 2021Updated 5 years ago
unicamp-dl / ExaRanker
View on GitHub
☆29Feb 2, 2024Updated 2 years ago
guilhermemr04 / scaling-zero-shot-retrieval
View on GitHub
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
☆29Sep 26, 2022Updated 3 years ago
disrpt / sharedtask2023
View on GitHub
Repository for DISRPT2023 shared task
☆17Jul 26, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
google-deepmind / dangerous-capability-evaluations
View on GitHub
☆73Jun 16, 2026Updated last month
patverga / plant_jones
View on GitHub
☆11Nov 10, 2015Updated 10 years ago
microsoft / text-to-sql-schema-expansion-generalization
View on GitHub
Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion
☆13Jul 26, 2023Updated 3 years ago
carriex / lfqa_eval
View on GitHub
ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"
☆21Mar 22, 2024Updated 2 years ago
ShuyangCao / hibrids_summ
View on GitHub
Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".
☆13May 24, 2022Updated 4 years ago
lifu-tu / ENGINE
View on GitHub
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
☆25Oct 2, 2020Updated 5 years ago
johnswentworth / tracelang
View on GitHub
Read, write and manipulate code which reads, writes and manipulates code.
☆11Mar 15, 2020Updated 6 years ago