prajjwal1 / adaptive_transformerLinks
Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)
☆43Updated 3 years ago
Alternatives and similar repositories for adaptive_transformer
Users that are interested in adaptive_transformer are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2020 paper CoDIR☆41Updated 3 years ago
- ☆84Updated 6 years ago
- An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)☆61Updated 4 years ago
- Domain Adaptation using External Knowledge for Sentiment Analysis☆53Updated 2 years ago
- Code for our EMNLP 2019 paper titled "Sentence-Level Content Planning and Style Specification for Neural Text Generation"☆17Updated 5 years ago
- Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction☆20Updated 3 years ago
- Source code for ICLR 2021 paper : Pre-training Text-to-Text Transformers for Concept-Centric Common Sense☆27Updated 4 years ago
- ☆44Updated 6 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆128Updated 4 years ago
- This repo provides the code for the ACL 2020 paper "Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEnco…☆55Updated 4 years ago
- Source Code for paper "NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction", WWW 2020☆47Updated 5 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆59Updated 5 years ago
- Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)☆34Updated 3 years ago
- Implementation for paper " Unsupervised Domain Adaptation on Reading Comprehension "☆30Updated 5 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Updated 2 years ago
- Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks☆56Updated 2 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Updated 4 years ago
- A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.☆86Updated 2 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆32Updated 3 years ago
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆73Updated 2 years ago
- ☆50Updated 2 years ago
- code for paper "Improving Sequence-to-Sequence Learning via Optimal Transport"☆68Updated 6 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces☆58Updated 4 years ago
- Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"☆44Updated 2 years ago
- Encoder-Agnostic Adaptation for Conditional Language Generation☆80Updated last year
- How Does Selective Mechanism Improve Self-attention Networks?☆29Updated 4 years ago
- ☆39Updated 6 years ago
- CausaLM: Causal Model Explanation Through Counterfactual Language Models☆56Updated 5 years ago
- A PyTorch implementation of Google AI's BERT model provided with Google's pre-trained models, examples and utilities.☆35Updated 6 years ago