microsoft / MASS
MASS: Masked Sequence to Sequence Pre-training for Language Generation
☆1,118Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MASS
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,412Updated 10 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,892Updated last year
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,620Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,197Updated 3 months ago
- Pre-trained ELMo Representations for Many Languages☆1,463Updated 3 years ago
- Code for using and evaluating SpanBERT.☆891Updated last year
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,239Updated 8 months ago
- Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…☆745Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,339Updated 7 months ago
- Simple XLNet implementation with Pytorch Wrapper☆577Updated 5 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,890Updated 2 years ago
- Code for paper Fine-tune BERT for Extractive Summarization☆1,468Updated 2 years ago
- pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"☆907Updated last year
- ☆360Updated last year
- Evaluating Cross-lingual Sentence Representations☆442Updated 3 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆1,211Updated 3 years ago
- code for EMNLP 2019 paper Text Summarization with Pretrained Encoders☆1,285Updated 3 months ago
- Fast BPE☆656Updated 5 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,087Updated 8 months ago
- ☆604Updated 3 weeks ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,246Updated 2 years ago
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆2,179Updated 2 years ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,511Updated 3 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,535Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,132Updated 9 months ago
- Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN☆959Updated 5 years ago
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,243Updated 2 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,389Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,187Updated last month
- BERT for Multitask Learning☆546Updated last year