microsoft/MASS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/MASS)

microsoft / MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

☆1,117

Alternatives and similar repositories for MASS

Users that are interested in MASS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,927Feb 14, 2023Updated 3 years ago
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,180May 28, 2023Updated 3 years ago
facebookresearch / UnsupervisedMT
View on GitHub
Phrase-Based & Neural Unsupervised Machine Translation
☆1,499Sep 15, 2021Updated 4 years ago
rsennrich / subword-nmt
View on GitHub
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,271Aug 7, 2024Updated last year
namisan / mt-dnn
View on GitHub
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,260Mar 7, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenNMT / OpenNMT-py
View on GitHub
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆7,007Oct 14, 2025Updated 9 months ago
glample / fastBPE
View on GitHub
Fast BPE
☆677Jun 18, 2024Updated 2 years ago
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,539Jul 8, 2026Updated last week
abisee / pointer-generator
View on GitHub
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
☆2,194Jun 16, 2022Updated 4 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,244Sep 30, 2025Updated 9 months ago
nlpyang / BertSum
View on GitHub
Code for paper Fine-tune BERT for Extractive Summarization
☆1,507Jan 11, 2022Updated 4 years ago
THUNLP-MT / MT-Reading-List
View on GitHub
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
☆2,437Aug 9, 2024Updated last year
thunlp / ERNIE
View on GitHub
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
☆1,419Jan 10, 2024Updated 2 years ago
google-research / electra
View on GitHub
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,368Mar 23, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bert-nmt / bert-nmt
View on GitHub
☆361Nov 22, 2022Updated 3 years ago
allenai / allennlp
View on GitHub
An open-source NLP research library, built on PyTorch.
☆11,889Nov 22, 2022Updated 3 years ago
facebookresearch / unlikelihood_training
View on GitHub
Neural Text Generation with Unlikelihood Training
☆311Aug 31, 2021Updated 4 years ago
nlpyang / PreSumm
View on GitHub
code for EMNLP 2019 paper Text Summarization with Pretrained Encoders
☆1,303Jul 25, 2024Updated last year
nyu-dl / bert-gen
View on GitHub
☆323Dec 16, 2022Updated 3 years ago
clab / fast_align
View on GitHub
Simple, fast unsupervised word aligner
☆769Jul 19, 2022Updated 4 years ago
microsoft / MPNet
View on GitHub
MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf
☆298Sep 11, 2021Updated 4 years ago
facebookresearch / MUSE
View on GitHub
A library for Multilingual Unsupervised or Supervised word Embeddings
☆3,248Aug 31, 2022Updated 3 years ago
allenai / bilm-tf
View on GitHub
Tensorflow implementation of contextualized word representations from bi-directional language models
☆1,612Mar 4, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / Mask-Predict
View on GitHub
A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…
☆247Sep 17, 2021Updated 4 years ago
moses-smt / mosesdecoder
View on GitHub
Moses, the machine translation system
☆1,625Mar 28, 2025Updated last year
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,979Nov 21, 2022Updated 3 years ago
asyml / texar-pytorch
View on GitHub
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…
☆747Apr 14, 2022Updated 4 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
asyml / texar
View on GitHub
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…
☆2,390Aug 26, 2021Updated 4 years ago
thunlp / PLMpapers
View on GitHub
Must-read Papers on pre-trained language models.
☆3,361Nov 6, 2022Updated 3 years ago
kimiyoung / transformer-xl
View on GitHub
☆3,707Sep 21, 2022Updated 3 years ago
neulab / compare-mt
View on GitHub
A tool for holistic analysis of language generations systems
☆471Sep 22, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / LAMA
View on GitHub
LAnguage Model Analysis
☆1,391Jul 7, 2024Updated 2 years ago
google / sentencepiece
View on GitHub
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,969Updated this week
salesforce / ctrl
View on GitHub
Conditional Transformer Language Model for Controllable Generation
☆1,881May 1, 2025Updated last year
yaserkl / RLSeq2Seq
View on GitHub
Deep Reinforcement Learning For Sequence to Sequence Models
☆767Mar 24, 2023Updated 3 years ago
jina-ai / clip-as-service
View on GitHub
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
☆12,829Jan 23, 2024Updated 2 years ago
google-research / bert
View on GitHub
TensorFlow code and pre-trained models for BERT
☆40,059Jul 23, 2024Updated last year
google-research / xtreme
View on GitHub
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆651Jan 4, 2023Updated 3 years ago