MirunaPislar / multi-head-attention-labeller

Joint text classification on multiple levels with multiple labels, using a multi-head attention mechanism to wire two prediction tasks together.

☆16

Alternatives and similar repositories for multi-head-attention-labeller:

Users that are interested in multi-head-attention-labeller are comparing it to the libraries listed below

btaille / contener
Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020
☆13Updated 8 months ago
IIEKES / MLM_transfer
☆17Updated 2 years ago
martiansideofthemoon / squash-website
Official demo repository for our ACL 2019 long paper "Generating Question-Answer Hierarchies".
☆19Updated 4 years ago
MurtyShikhar / ExpBERT
Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"
☆29Updated 4 years ago
zhangyi24 / sentence_transformer_zh
☆32Updated 3 years ago
ajitrajasekharan / bert_mask
This is an example program illustrating BERTs masked language model.
☆28Updated 4 years ago
jacobvsdanniel / cross-ner
Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER
☆9Updated 5 years ago
BorealisAI / cross_domain_coherence
A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912
☆24Updated 4 years ago
MiuLab / Lattice-ELMo
Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"
☆19Updated 2 years ago
Sandeep42 / anuvada
Interpretable Models for NLP using PyTorch
☆18Updated 7 years ago
wenhuchen / GPT2-Logic2Text
The code for Template-GPT-2 Generation Model for Logic2Text Dataset
☆18Updated 4 years ago
shashwattrivedi / Attention_visualizer
A visualizer to display attention weights on text
☆23Updated 6 years ago
MatanBN / XRTransfer
Code and dataset for "Transfer Learning Between Related Tasks Using Expected Label Proportions"
☆16Updated 5 years ago
scewiner / Leveraging
Leveraging Local and Global Patterns for Self-Attention Networks
☆12Updated 5 years ago
florianmai / word2mat
Code for ICLR 2019 paper 'CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model'
☆21Updated 5 years ago
ypuzikov / Graph-based-AMR-Parser
A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)
☆11Updated 7 years ago
lucidrains / distilled-retriever-pytorch
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Updated 4 years ago
IIEKES / cbert_aug_deprecated
☆9Updated 5 years ago
allanj / neural-partialCRF
Neural (LSTM) version of the partial CRF model
☆35Updated 5 years ago
CLUEbenchmark / LGEB
LGEB: Benchmark of Language Generation Evaluation
☆16Updated 2 years ago
stefan-it / capsnet-nlp
CapsNet for NLP
☆67Updated 6 years ago
lukasgarbas / multiencoder
Combining encoder-based language models
☆11Updated 3 years ago
jiesutd / PyTorchSequence
☆9Updated 6 years ago
lucidrains / coco-lm-pytorch
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆45Updated 4 years ago
mingdachen / word-cluster-embedding
Code for "Smaller Text Classifiers with Discriminative Cluster Embeddings" (NAACL 2018)
☆29Updated 6 years ago
bzhangGo / lrn
Source code for "A Lightweight Recurrent Network for Sequence Modeling"
☆26Updated 2 years ago
ishalyminov / babi_tools
Augmentation scripts for the bAbI Dialog Tasks dataset
☆13Updated 6 years ago
text-machine-lab / adversarial_decomposition
The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019
☆29Updated 2 years ago
CyberZHG / keras-adaptive-softmax
Adaptive embedding and softmax
☆17Updated 3 years ago
MiuLab / HNLG
Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…
☆32Updated 6 years ago