Beyond Accuracy: Behavioral Testing of NLP models with CheckList
β2,050Jan 9, 2024Updated 2 years ago
Alternatives and similar repositories for checklist
Users that are interested in checklist are comparing it to the libraries listed below
Sorting:
- TextAttack π is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocsβ¦β3,379Jul 10, 2025Updated 8 months ago
- Data augmentation for NLPβ4,652Jun 24, 2024Updated last year
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic β¦β3,641Mar 11, 2026Updated last week
- Robustness Gym is an evaluation toolkit for machine learning.β445Jun 28, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cβ¦β359Feb 22, 2022Updated 4 years ago
- An open-source NLP research library, built on PyTorch.β11,893Nov 22, 2022Updated 3 years ago
- jiant is an nlp toolkitβ1,674Jul 6, 2023Updated 2 years ago
- Library for Knowledge Intensive Language Tasksβ970Mar 31, 2022Updated 3 years ago
- Fast, general, and tested differentiable structured prediction in PyTorchβ1,124Apr 20, 2022Updated 3 years ago
- Papers & presentation materials from Hugging Face's internal science dayβ2,054Oct 31, 2020Updated 5 years ago
- A python tool for evaluating the quality of sentence embeddings.β2,105Mar 19, 2024Updated 2 years ago
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ786May 19, 2024Updated last year
- State-of-the-Art Text Embeddingsβ18,390Mar 12, 2026Updated last week
- Shared repository for open-sourced projects from the Google AI Language team.β1,760Updated this week
- Multi-Task Deep Neural Networks for Natural Language Understandingβ2,257Mar 7, 2024Updated 2 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"β1,626Jun 12, 2023Updated 2 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the moβ¦β22,975Jul 28, 2024Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,494Jan 14, 2026Updated 2 months ago
- Longformer: The Long-Document Transformerβ2,189Feb 8, 2023Updated 3 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understandingβ6,176May 28, 2023Updated 2 years ago
- LAnguage Model Analysisβ1,390Jul 7, 2024Updated last year
- ACL2020 Tutorial: Open-Domain Question Answeringβ835Jan 1, 2021Updated 5 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β14,354Oct 27, 2025Updated 4 months ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining theβ¦β2,088Aug 15, 2024Updated last year
- A system for quickly generating training data with weak supervisionβ5,937May 2, 2024Updated last year
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020β606Jun 4, 2020Updated 5 years ago
- FastFormers - highly efficient transformer models for NLUβ709Mar 21, 2025Updated 11 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.β2,927Feb 14, 2023Updated 3 years ago
- BertViz: Visualize Attention in Transformer Modelsβ7,954Jan 8, 2026Updated 2 months ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821β3,643Oct 16, 2024Updated last year
- π Scalable embedding, reasoning, ranking for images and sentences with CLIPβ12,824Jan 23, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,190Sep 30, 2025Updated 5 months ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generatorsβ2,371Mar 23, 2024Updated last year
- BERT score for text generationβ1,880Jul 30, 2024Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.β788Aug 4, 2023Updated 2 years ago
- Interpretable Evaluation for AI Systemsβ366Mar 10, 2023Updated 3 years ago
- Must-read Papers on pre-trained language models.β3,363Nov 6, 2022Updated 3 years ago
- Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processingβ652Sep 27, 2022Updated 3 years ago
- BERT-related papersβ2,039Aug 12, 2023Updated 2 years ago