ameet-1997 / AttentionGuidanceLinks
Guiding Attention for Self-Supervised Learning with Transformers
☆12Updated 2 years ago
Alternatives and similar repositories for AttentionGuidance
Users that are interested in AttentionGuidance are comparing it to the libraries listed below
Sorting:
- Language Model Baselines for PyTorch☆41Updated 5 years ago
- code for paper "Improving Sequence-to-Sequence Learning via Optimal Transport"☆68Updated 6 years ago
- Code for the paper "A Fully Hyperbolic Neural Model for Hierarchical Multi-class Classification"☆17Updated 4 years ago
- This code repository presents the pytorch implementation of the paper “Implicit Deep Latent Variable Models for Text Generation”(EMNLP 20…☆55Updated 3 years ago
- "Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021☆18Updated 4 years ago
- statnlp-neural☆32Updated 5 years ago
- This repository contains the code used for Ordered Memory paper☆30Updated 5 years ago
- [EMNLP 2020] Data and PyTorch code of ConjNLI: Natural Language Inference over Conjunctive Sentences☆11Updated 4 years ago
- Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data☆36Updated 4 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆85Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Updated 3 years ago
- Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"☆16Updated 4 years ago
- ☆62Updated 3 years ago
- Layerwise Relevance Visualization in Convolutional Text Graph Classifiers☆12Updated 4 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Updated 3 years ago
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated 2 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Updated 2 years ago
- PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)☆48Updated 5 years ago
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation☆25Updated 4 years ago
- ☆12Updated 6 years ago
- Deep Weighted Averaging Classifiers☆23Updated 6 years ago
- ☆45Updated 3 years ago
- ☆22Updated 4 years ago
- Code for "Variational Sequential Labelers for Semi-Supervised Learning" (EMNLP 2018)☆34Updated 6 years ago
- Tensorflow implementation of Invariant Rationalization☆49Updated 2 years ago
- Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)☆27Updated 3 years ago
- ☆24Updated 3 months ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆71Updated last year
- Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)☆35Updated 3 years ago
- The data and code for NumerSense (EMNLP2020)☆19Updated 2 years ago