microsoft / adaptive-testingLinks
Find and fix bugs in natural language machine learning models using adaptive testing.
☆183Updated last year
Alternatives and similar repositories for adaptive-testing
Users that are interested in adaptive-testing are comparing it to the libraries listed below
Sorting:
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆177Updated 3 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- A diff tool for language models☆42Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆104Updated 2 years ago
- Robustness Gym is an evaluation toolkit for machine learning.☆440Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 8 months ago
- Annotated corpus + evaluation metrics for text anonymisation☆56Updated last year
- ☆76Updated 3 years ago
- Stanford's Alexa Prize socialbot☆133Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆203Updated 3 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆313Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆464Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆149Updated last year
- ☆65Updated last year
- Interpretable Evaluation for AI Systems☆367Updated 2 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆73Updated 10 months ago
- Adversarial Natural Language Inference Benchmark☆396Updated 3 years ago
- Code and Data for Evaluation WG☆41Updated 3 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆129Updated last year
- Evaluation suite for large-scale language models.☆126Updated 3 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated last year
- Question-answers, collected from Google☆129Updated 3 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated 2 years ago
- Utilities for the HuggingFace transformers library☆68Updated 2 years ago
- ☆97Updated 2 years ago
- Inquisitive Parrots for Search☆193Updated 2 weeks ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆573Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago