Beyond Accuracy: Behavioral Testing of NLP models with CheckList
β2,048Jan 9, 2024Updated 2 years ago
Alternatives and similar repositories for checklist
Users that are interested in checklist are comparing it to the libraries listed below
Sorting:
- TextAttack π is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocsβ¦β3,364Jul 10, 2025Updated 7 months ago
- Data augmentation for NLPβ4,645Jun 24, 2024Updated last year
- Robustness Gym is an evaluation toolkit for machine learning.β445Jun 28, 2022Updated 3 years ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic β¦β3,636Feb 20, 2026Updated last week
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cβ¦β359Feb 22, 2022Updated 4 years ago
- An open-source NLP research library, built on PyTorch.β11,889Nov 22, 2022Updated 3 years ago
- Library for Knowledge Intensive Language Tasksβ965Mar 31, 2022Updated 3 years ago
- jiant is an nlp toolkitβ1,674Jul 6, 2023Updated 2 years ago
- Fast, general, and tested differentiable structured prediction in PyTorchβ1,123Apr 20, 2022Updated 3 years ago
- Papers & presentation materials from Hugging Face's internal science dayβ2,052Oct 31, 2020Updated 5 years ago
- Shared repository for open-sourced projects from the Google AI Language team.β1,749Feb 20, 2026Updated last week
- A python tool for evaluating the quality of sentence embeddings.β2,106Mar 19, 2024Updated last year
- Multi-Task Deep Neural Networks for Natural Language Understandingβ2,258Mar 7, 2024Updated last year
- State-of-the-Art Text Embeddingsβ18,298Feb 20, 2026Updated last week
- Longformer: The Long-Document Transformerβ2,188Feb 8, 2023Updated 3 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,490Jan 14, 2026Updated last month
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ786May 19, 2024Updated last year
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"β1,628Jun 12, 2023Updated 2 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the moβ¦β22,981Jul 28, 2024Updated last year
- LAnguage Model Analysisβ1,390Jul 7, 2024Updated last year
- XLNet: Generalized Autoregressive Pretraining for Language Understandingβ6,176May 28, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β14,359Oct 27, 2025Updated 4 months ago
- FastFormers - highly efficient transformer models for NLUβ709Mar 21, 2025Updated 11 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.β2,924Feb 14, 2023Updated 3 years ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020β606Jun 4, 2020Updated 5 years ago
- ACL2020 Tutorial: Open-Domain Question Answeringβ835Jan 1, 2021Updated 5 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining theβ¦β2,083Aug 15, 2024Updated last year
- A system for quickly generating training data with weak supervisionβ5,940May 2, 2024Updated last year
- BertViz: Visualize Attention in Transformer Modelsβ7,921Jan 8, 2026Updated last month
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generatorsβ2,371Mar 23, 2024Updated last year
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.β1,752Dec 20, 2023Updated 2 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821β3,641Oct 16, 2024Updated last year
- π Scalable embedding, reasoning, ranking for images and sentences with CLIPβ12,817Jan 23, 2024Updated 2 years ago
- BERT score for text generationβ1,873Jul 30, 2024Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.β786Aug 4, 2023Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,170Sep 30, 2025Updated 5 months ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 tyβ¦β650Jan 4, 2023Updated 3 years ago
- Must-read Papers on pre-trained language models.β3,362Nov 6, 2022Updated 3 years ago
- Entity Linker solutionβ1,206Sep 21, 2023Updated 2 years ago