ChunyuanLI / Optimus
Optimus: the first large-scale pre-trained VAE language model
☆376Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Optimus
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆241Updated 3 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆244Updated 4 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 3 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 2 years ago
- ☆175Updated 2 years ago
- Pytorch implementation of "A Probabilistic Formulation of Unsupervised Text Style Transfer" by He. et. al. at ICLR 2020☆163Updated 2 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆305Updated last year
- Transformer-Based Conditioned Variational Autoencoder for Story Completion☆94Updated 4 years ago
- Implementation of the paper Tree Transformer☆210Updated 4 years ago
- ☆454Updated 3 years ago
- Easily fine tune GPT-2 to fill in missing text☆197Updated last year
- DGMs for NLP. A roadmap.☆392Updated last year
- Official code and data repository for our EMNLP 2020 long paper "Reformulating Unsupervised Style Transfer as Paraphrase Generation" (htt…☆229Updated 2 years ago
- The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"☆309Updated last year
- Transformer-based Conditional Variational Autoencoder for Controllable Story Generation☆146Updated 2 years ago
- Neural network parametrized objective to disentangle and transfer style and content in text☆139Updated 5 years ago
- Code for "Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation" (NeurIPS 2019)☆128Updated 5 years ago
- ☆315Updated 3 years ago
- This repo collects the articles for text attribute transfer☆243Updated 3 years ago
- Understanding the Difficulty of Training Transformers☆328Updated 2 years ago
- Codebase for testing whether hidden states of neural networks encode discrete structures.☆383Updated 8 months ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆325Updated 10 months ago
- ☆204Updated 7 months ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆250Updated 3 years ago
- ☆212Updated 4 years ago
- An implementation of masked language modeling for Pytorch, made as concise and simple as possible☆177Updated last year
- Code accompanying our papers on the "Generative Distributional Control" framework☆117Updated last year
- ☆320Updated last year
- MPNet: Masked and Permuted Pre-training for Language Understanding https://arxiv.org/pdf/2004.09297.pdf☆288Updated 3 years ago
- PyTorch implementation of beam search decoding for seq2seq models☆336Updated last year