facebookresearch / irt-leaderboardView external linksLinks
Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simplicity can mask nuances in evaluation items (examples) and subjects (NLP models). Rather than replace leaderboards, we advocate a re-imagining so that they better highlight if and where progress is made. Buildi…
☆18Mar 30, 2022Updated 3 years ago
Alternatives and similar repositories for irt-leaderboard
Users that are interested in irt-leaderboard are comparing it to the libraries listed below
Sorting:
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Oct 17, 2023Updated 2 years ago
- Automatically detect errors in annotated corpora.☆48Sep 8, 2023Updated 2 years ago
- Black for Python docstrings and reStructuredText (rst).☆18Apr 7, 2023Updated 2 years ago
- DicSin - Dicionário de Sinônimos Português Brasil☆22May 21, 2018Updated 7 years ago
- ☆30Feb 27, 2023Updated 2 years ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- A community repository for MI_CLAIM (Minimum Information for CLinical AI Modeling) reporting standards☆23Jan 14, 2021Updated 5 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- [DEPRECATED] An R package to pre-process bulk EKG data and detect the physiological peaks☆12Aug 22, 2016Updated 9 years ago
- Pequenos projetos e testes simples em linguagem Python.☆11Jan 28, 2018Updated 8 years ago
- A collection of Claude commands and utilities☆24Updated this week
- Workshop Home Page for Benchmarking: Past, Present and Future☆35Sep 26, 2021Updated 4 years ago
- Multiple Instance Learning Networks for Fine-Grained Sentiment Analysis☆35Nov 26, 2018Updated 7 years ago
- Python Version of Andrew Welter's Hatebase Wrapper☆10Feb 20, 2022Updated 3 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- This package is archived and further developed under the name RFSurrogates. In this R-package functions are provided to select important …☆11Jul 27, 2023Updated 2 years ago
- Installs and manages Stata programs tracked as git repositories.☆11Sep 13, 2017Updated 8 years ago
- This repository contain a dataset describing the urgency of admission among COVID-19 patients, intended for use in a predictive modeling …☆10Mar 16, 2020Updated 5 years ago
- Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/de…☆10Jun 22, 2020Updated 5 years ago
- Stochastic Kronecker Generation in Python, Used in RPI TRUST☆10Dec 13, 2017Updated 8 years ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- An R package containing functions used in the CDC Flu Forecasting competition☆11Oct 18, 2020Updated 5 years ago
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆11Aug 12, 2021Updated 4 years ago
- FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentenc…☆11Jun 12, 2025Updated 8 months ago
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- web programming course (COMPSCI 326, UMass Amherst)☆14Sep 13, 2022Updated 3 years ago
- D-Lab's 2-hour workshop on AI-assisted coding in Visual Studio Code using GitHub Copilot and R.☆14Nov 18, 2025Updated 2 months ago
- Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.☆43Aug 16, 2021Updated 4 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 7 months ago
- Code for Paper "Effective Multi-agent Reinforcement Learning Control with Relative Entropy Regularization".☆13Sep 27, 2023Updated 2 years ago
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆11Aug 1, 2023Updated 2 years ago
- Plugin to normalize score using Min Max or Z Score normalizer.☆10Mar 25, 2021Updated 4 years ago
- Computing and reproducibility bootcamp for Duke StatSci graduate students.☆11Aug 26, 2016Updated 9 years ago
- Tree interpretation methods based on ranger☆13Jan 30, 2026Updated 2 weeks ago
- Sample size for external validation of a logistic regression based prediction model☆12Sep 20, 2021Updated 4 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- ☆11Nov 19, 2020Updated 5 years ago
- Pretrained segmenter models for Portuguese legislative text.☆13Oct 13, 2024Updated last year