prajjwal1 / adaptive_transformerLinks
Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)
☆43Updated 2 years ago
Alternatives and similar repositories for adaptive_transformer
Users that are interested in adaptive_transformer are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2020 paper CoDIR☆41Updated 3 years ago
- An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)☆61Updated 4 years ago
- Domain Adaptation using External Knowledge for Sentiment Analysis☆53Updated 2 years ago
- Source code for ICLR 2021 paper : Pre-training Text-to-Text Transformers for Concept-Centric Common Sense☆27Updated 4 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Updated 2 years ago
- Implementation for paper " Unsupervised Domain Adaptation on Reading Comprehension "☆30Updated 5 years ago
- code for paper "Improving Sequence-to-Sequence Learning via Optimal Transport"☆68Updated 6 years ago
- Neural Machine Translation with universal Visual Representation (ICLR 2020)☆90Updated 5 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated last year
- Densely Connected Graph Convolutional Networks for Graph-to-Sequence Learning (authors' MXNet implementation for the TACL19 paper)☆78Updated 4 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆32Updated 3 years ago
- ☆36Updated 5 years ago
- Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)☆35Updated 3 years ago
- Official code for the paper "PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains".☆51Updated 3 years ago
- Source Code for paper "NERO: A Neural Rule Grounding Framework for Label-Efficient Relation Extraction", WWW 2020☆47Updated 5 years ago
- Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks☆56Updated 2 years ago
- ☆44Updated 6 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆131Updated 4 years ago
- Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction☆20Updated 3 years ago
- ☆50Updated 2 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- Lite Self-Training☆29Updated 2 years ago
- Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"☆45Updated 2 years ago
- ☆84Updated 5 years ago
- Code for our EMNLP 2019 paper titled "Sentence-Level Content Planning and Style Specification for Neural Text Generation"☆17Updated 5 years ago
- Unicoder model for understanding and generation.☆91Updated last year
- Code for our NAACL-2021 paper "Disentangling Semantics and Syntax in Sentence Embeddings with Pre-trained Language Models".☆23Updated 3 years ago
- Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)☆145Updated 3 years ago
- How Does Selective Mechanism Improve Self-attention Networks?☆29Updated 4 years ago
- statnlp-neural☆32Updated 6 years ago