facebookresearch / irt-leaderboard
Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simplicity can mask nuances in evaluation items (examples) and subjects (NLP models). Rather than replace leaderboards, we advocate a re-imagining so that they better highlight if and where progress is made. Buildi…
☆17Updated 2 years ago
Alternatives and similar repositories for irt-leaderboard:
Users that are interested in irt-leaderboard are comparing it to the libraries listed below
- ☆24Updated 5 years ago
- ☆19Updated 5 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- ☆29Updated last year
- Participant Kit for the TextGraphs-15 Shared Task on Explanation Regeneration☆19Updated 3 years ago
- ☆15Updated 4 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Post-editing Datasets by Rakuten (PEDRa)☆14Updated 3 years ago
- ☆38Updated 4 years ago
- ☆24Updated 4 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 4 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆13Updated 7 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- Code for the paper "Latent Relation Language Models" at AAAI-20.☆41Updated 4 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- This repo contains datasets and code for Assessing Phrasal Representation and Composition in Transformers, by Lang Yu and Allyson Ettinge…☆11Updated 3 years ago
- TextGraphs-13 Shared Task on Multi-Hop Inference Explanation Regeneration☆44Updated 5 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".☆22Updated 5 years ago
- ☆32Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- ☆12Updated 4 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 5 years ago
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Updated 4 years ago