MirunaPislar / multi-head-attention-labeller
Joint text classification on multiple levels with multiple labels, using a multi-head attention mechanism to wire two prediction tasks together.
☆16Updated 4 years ago
Alternatives and similar repositories for multi-head-attention-labeller:
Users that are interested in multi-head-attention-labeller are comparing it to the libraries listed below
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Updated 8 months ago
- ☆17Updated 2 years ago
- Official demo repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies".☆19Updated 4 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- ☆32Updated 3 years ago
- This is an example program illustrating BERTs masked language model.☆28Updated 4 years ago
- Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER☆9Updated 5 years ago
- A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912☆24Updated 4 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆19Updated 2 years ago
- Interpretable Models for NLP using PyTorch☆18Updated 7 years ago
- The code for Template-GPT-2 Generation Model for Logic2Text Dataset☆18Updated 4 years ago
- A visualizer to display attention weights on text☆23Updated 6 years ago
- Code and dataset for "Transfer Learning Between Related Tasks Using Expected Label Proportions"☆16Updated 5 years ago
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Updated 5 years ago
- Code for ICLR 2019 paper 'CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model'☆21Updated 5 years ago
- A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)☆11Updated 7 years ago
- Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"☆32Updated 4 years ago
- ☆9Updated 5 years ago
- Neural (LSTM) version of the partial CRF model☆35Updated 5 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Updated 2 years ago
- CapsNet for NLP☆67Updated 6 years ago
- Combining encoder-based language models☆11Updated 3 years ago
- ☆9Updated 6 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 4 years ago
- Code for "Smaller Text Classifiers with Discriminative Cluster Embeddings" (NAACL 2018)☆29Updated 6 years ago
- Source code for "A Lightweight Recurrent Network for Sequence Modeling"☆26Updated 2 years ago
- Augmentation scripts for the bAbI Dialog Tasks dataset☆13Updated 6 years ago
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Updated 2 years ago
- Adaptive embedding and softmax☆17Updated 3 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆32Updated 6 years ago