microsoft / MT-DNNLinks
Multi-Task Deep Neural Networks for Natural Language Understanding
☆164Updated 2 years ago
Alternatives and similar repositories for MT-DNN
Users that are interested in MT-DNN are comparing it to the libraries listed below
Sorting:
- X-Transformer: Taming Pretrained Transformers for eXtreme Multi-label Text Classification☆138Updated 4 years ago
- A library to conduct ranking experiments with transformers.☆160Updated 2 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 3 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆117Updated last year
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated last year
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated 2 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆108Updated 4 years ago
- ICML'2022: NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework☆255Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- [NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240☆168Updated 2 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆532Updated 3 years ago
- Few-shot Natural Language Generation for Task-Oriented Dialog☆189Updated 2 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆179Updated 4 years ago
- Code for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019☆139Updated 3 years ago
- ☆96Updated 5 years ago
- Semantics-aware BERT for Language Understanding (AAAI 2020)☆288Updated 2 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- [EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification☆128Updated 2 years ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆294Updated 2 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆97Updated 2 years ago
- ☆162Updated 5 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆132Updated 2 years ago
- Interpreting Bidirectional Encoder Representations from Transformers (BERT)☆110Updated 2 years ago
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆372Updated 2 years ago
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆294Updated 3 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆132Updated this week
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆107Updated 2 years ago