g8a9 / ferret
A python package for benchmarking interpretability techniques on Transformers.
☆212Updated 6 months ago
Alternatives and similar repositories for ferret:
Users that are interested in ferret are comparing it to the libraries listed below
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆108Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆55Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 7 months ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆87Updated last year
- Find and fix bugs in natural language machine learning models using adaptive testing.☆183Updated 11 months ago
- Code repository for the NAACL 2022 paper "ExSum: From Local Explanations to Model Understanding"☆64Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆333Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆202Updated 2 years ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- Interpretability for sequence generation models 🐛 🔍☆412Updated 5 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Active Learning for Text Classification in Python☆613Updated 2 weeks ago
- Efficient Attention for Long Sequence Processing☆93Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆497Updated last month
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 11 months ago
- A curated list of programmatic weak supervision papers and resources☆190Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- SPEAR: Programmatically label and build training data quickly.☆106Updated 9 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆92Updated last year
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- Self-training with Weak Supervision (NAACL 2021)☆160Updated last year
- Check if you have training samples in your test set☆64Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- ☆137Updated last year