microsoft / adaptive-testingLinks
Find and fix bugs in natural language machine learning models using adaptive testing.
☆183Updated last year
Alternatives and similar repositories for adaptive-testing
Users that are interested in adaptive-testing are comparing it to the libraries listed below
Sorting:
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆177Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Question-answers, collected from Google☆129Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆102Updated 2 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆150Updated last year
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆572Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆56Updated last year
- Robustness Gym is an evaluation toolkit for machine learning.☆440Updated 2 years ago
- ☆98Updated 2 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆61Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- A diff tool for language models☆42Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- ☆72Updated 2 years ago
- ☆78Updated last year
- Pretrained Language Models for Source code☆253Updated 4 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated last year
- ☆76Updated 3 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆211Updated 8 months ago
- AI Data Management & Evaluation Platform☆215Updated last year
- ☆65Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- ☆133Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆254Updated last year
- Stanford's Alexa Prize socialbot☆133Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆464Updated 2 years ago