microsoft / MASS
MASS: Masked Sequence to Sequence Pre-training for Language Generation
☆1,117Updated 2 years ago
Alternatives and similar repositories for MASS:
Users that are interested in MASS are comparing it to the libraries listed below
- Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…☆745Updated 2 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,619Updated 2 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,903Updated 2 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,227Updated 7 months ago
- Pre-trained ELMo Representations for Many Languages☆1,461Updated 3 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,249Updated last year
- pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"☆910Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,350Updated last year
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,415Updated last year
- Code for paper Fine-tune BERT for Extractive Summarization☆1,481Updated 3 years ago
- jiant is an nlp toolkit☆1,663Updated last year
- Evaluating Cross-lingual Sentence Representations☆450Updated 3 years ago
- Fast BPE☆668Updated 9 months ago
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆2,189Updated 2 years ago
- Simple XLNet implementation with Pytorch Wrapper☆582Updated 5 years ago
- A Tensorflow implementation of QANet for machine reading comprehension☆981Updated 6 years ago
- BERT for Multitask Learning☆546Updated last year
- ☆604Updated 4 months ago
- Code for using and evaluating SpanBERT.☆896Updated last year
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,246Updated 2 years ago
- BERT-related papers☆2,041Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,100Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,205Updated 5 months ago
- Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN☆962Updated 6 years ago
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,291Updated 8 months ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,895Updated 2 years ago
- Empower Sequence Labeling with Task-Aware Language Model☆846Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆640Updated 2 years ago
- Evaluation code for various unsupervised automated metrics for Natural Language Generation.☆1,371Updated 7 months ago
- ☆362Updated 2 years ago