dmg-illc/JUDGE-BENCH

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dmg-illc/JUDGE-BENCH)

dmg-illc / JUDGE-BENCH

☆40

Alternatives and similar repositories for JUDGE-BENCH

Users that are interested in JUDGE-BENCH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amith-ananthram / feelingblue
View on GitHub
FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023
☆13Dec 28, 2023Updated 2 years ago
personads / depprobe
View on GitHub
Probing for Labeled Dependency Trees (ACL 2022) + Sorting LMs by Structure (NAACL 2022)
☆10Jun 11, 2024Updated 2 years ago
xiye17 / EvalQAExpl
View on GitHub
Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.
☆17Apr 25, 2021Updated 5 years ago
turkish-nlp-suite / Turkish-Wiki-NER-Dataset
View on GitHub
Repo for Turkish Wiki NER dataset.
☆13Jul 11, 2023Updated 3 years ago
1171-jpg / MARVEL_AVR
View on GitHub
Github repo for MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning
☆18Jun 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hectormartinez / ud_unsup_parser
View on GitHub
☆22Jun 22, 2022Updated 4 years ago
sunlab-osu / ReasonBERT
View on GitHub
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆28Feb 1, 2023Updated 3 years ago
ekinakyurek / influence
View on GitHub
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆40Dec 27, 2022Updated 3 years ago
cambridgeltl / ACL2022_tutorial_multilingual_dialogue
View on GitHub
Materials for "Natural Language Processing for Multilingual Task-Oriented Dialogue" Tutorial at ACL 2022
☆14May 21, 2022Updated 4 years ago
jjzha / cartography-al
View on GitHub
Code base for the EMNLP 2021 Findings paper: Cartography Active Learning
☆14Jun 3, 2025Updated last year
Leukas / CUTE
View on GitHub
☆20Apr 26, 2026Updated 3 months ago
AaltoML / PeriodicBNN
View on GitHub
Code for 'Periodic Activation Functions Induce Stationarity' (NeurIPS 2021)
☆21Oct 27, 2021Updated 4 years ago
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
yanaiela / TNE
View on GitHub
codebase for the Text-based NP Enrichment (TNE) paper
☆19Mar 12, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GuillaumeDD / dialign
View on GitHub
Automatic and generic measures of verbal alignment in dyadic dialogue based on sequential pattern mining at the level of surface of text …
☆13May 11, 2025Updated last year
UKPLab / nessie
View on GitHub
Automatically detect errors in annotated corpora.
☆48Sep 8, 2023Updated 2 years ago
RLG-Leiden / edugym
View on GitHub
☆15Sep 22, 2023Updated 2 years ago
timoschick / am-for-bert
View on GitHub
This repository contains the WordNet Language Model Probing (WNLaMPro) dataset introduced in "Rare Words: A Major Problem for Contextuali…
☆14Feb 2, 2020Updated 6 years ago
GChrysostomou / ood_faith
View on GitHub
☆13Jul 26, 2023Updated 3 years ago
pavelsof / ipavec
View on GitHub
IPA alignment using vector representations
☆11Mar 30, 2020Updated 6 years ago
kayoyin / interpret-lm
View on GitHub
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆63May 12, 2022Updated 4 years ago
mlcommons / dynabench
View on GitHub
☆29Feb 11, 2026Updated 5 months ago
marcos452 / SemGes
View on GitHub
This official GitHub repository for SemGes: Semantics-aware Co-Speech Gesture Generation using Semantic Coherence and Relevance Learning(…
☆22Aug 8, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year
AI4Bharat / FBI
View on GitHub
FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists
☆31Aug 14, 2025Updated 11 months ago
cambridgeltl / zepo
View on GitHub
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)
☆14Oct 3, 2024Updated last year
rycolab / revisiting-uid
View on GitHub
Analysis pipeline for Revisiting UID (EMNLP 2021)
☆12Oct 24, 2022Updated 3 years ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆13Jun 28, 2026Updated last month
jvladika / HealthFC
View on GitHub
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
☆14Apr 11, 2025Updated last year
yahshibu / nested-ner-tacl2020-flair
View on GitHub
Implementation of Nested Named Entity Recognition using Flair
☆24Oct 29, 2021Updated 4 years ago
mainlp / awesome-human-label-variation
View on GitHub
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …
☆102Apr 15, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
microsoft / multimodal-aligned-recipe-corpus
View on GitHub
☆18Jun 5, 2024Updated 2 years ago
GuyTevet / diversity-eval
View on GitHub
Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"
☆21Feb 23, 2021Updated 5 years ago
aagohary / canard
View on GitHub
Repo for the question-in-context rewriting baseline presented in Elgohary et al. "Can you unpack that? Learning to rewrite questions-in-c…
☆24May 20, 2020Updated 6 years ago
juliarodina / RuSemShift
View on GitHub
Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…
☆14Feb 21, 2025Updated last year
pmandera / semspaces
View on GitHub
Semantic spaces in python
☆14Jul 6, 2023Updated 3 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
smartschat / art
View on GitHub
Approximate randomization testing.
☆19Apr 17, 2020Updated 6 years ago