swabhs / notebooks_for_aflite
IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".
☆16Updated 4 years ago
Alternatives and similar repositories for notebooks_for_aflite:
Users that are interested in notebooks_for_aflite are comparing it to the libraries listed below
- ☆24Updated 3 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19Updated 2 years ago
- [EMNLP 2020] "T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack" by Boxin Wang, Hengzhi Pei, Boyuan Pan, Q…☆26Updated 3 years ago
- OOD Generalization and Detection (ACL 2020)☆60Updated 4 years ago
- Code for "Interpretable Image Recognition with Hierarchical Prototypes"☆18Updated 5 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 3 years ago
- Explaining neural decisions contrastively to alternative decisions.☆23Updated 3 years ago
- Code for "Imitation Attacks and Defenses for Black-box Machine Translations Systems"☆36Updated 4 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- ☆86Updated last year
- EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples☆23Updated 4 years ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆69Updated 4 months ago
- Group-conditional DRO to alleviate spurious correlations☆15Updated 3 years ago
- Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data☆35Updated 4 years ago
- ☆9Updated 4 years ago
- In-context Example Selection with Influences☆15Updated last year
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Updated 3 years ago
- ☆38Updated 3 years ago
- Code for "Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?"☆44Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆19Updated 2 years ago
- ☆63Updated 2 years ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 4 years ago
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Updated 3 years ago
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆17Updated 2 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆36Updated 2 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆20Updated 2 years ago
- Dataset + classifier tools to study social perception biases in natural language generation☆67Updated last year
- ☆15Updated 3 years ago