microsoft / MT-DNNLinks
Multi-Task Deep Neural Networks for Natural Language Understanding
☆166Updated 2 years ago
Alternatives and similar repositories for MT-DNN
Users that are interested in MT-DNN are comparing it to the libraries listed below
Sorting:
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆71Updated 4 years ago
- X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification☆141Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 5 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated 2 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- A library to conduct ranking experiments with transformers.☆160Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 4 years ago
- ☆99Updated 5 years ago
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated 2 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆174Updated 2 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆253Updated 2 years ago
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240☆168Updated 3 years ago
- Code for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019☆137Updated 4 years ago
- [EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification☆130Updated 2 years ago
- Few-shot Natural Language Generation for Task-Oriented Dialog☆189Updated 3 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆96Updated 2 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models