huanghonggit / Mask-Language-ModelLinks

pytorch； mask language model ； bert

☆72

Alternatives and similar repositories for Mask-Language-Model

Users that are interested in Mask-Language-Model are comparing it to the libraries listed below

Sorting:

budzianowski / PyTorch-Beam-Search-Decoding
PyTorch implementation of beam search decoding for seq2seq models
☆337Updated 2 years ago
shentianxiao / text-autoencoders
☆209Updated last year
facebookresearch / Mask-Predict
A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…
☆244Updated 3 years ago
lucidrains / electra-pytorch
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch
☆227Updated 2 years ago
guolinke / TUPE
Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…
☆251Updated 3 years ago
312shan / Pytorch-seq2seq-Beam-Search
PyTorch implementation for Seq2Seq model with attention and Greedy Search / Beam Search for neural machine translation
☆58Updated 4 years ago
andrewpeng02 / transformer-translation
Using Pytorch's nn.Transformer module to create an english to french neural machine translation model.
☆78Updated 5 years ago
fangleai / TransformerCVAE
Transformer-based Conditional Variational Autoencoder for Controllable Story Generation
☆155Updated 3 years ago
ChunyuanLI / Optimus
Optimus: the first large-scale pre-trained VAE language model
☆390Updated last year
yanzhangnlp / IS-BERT
An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)
☆61Updated 4 years ago
lucidrains / mlm-pytorch
An implementation of masked language modeling for Pytorch, made as concise and simple as possible
☆179Updated last year
fastnlp / style-transformer
☆177Updated 3 years ago
jiesutd / Text-Attention-Heatmap-Visualization
Plot the vector graph of attention based text visualisation
☆371Updated 6 years ago
asappresearch / revisit-bert-finetuning
For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).
☆184Updated 2 years ago
dreamgonfly / BERT-pytorch
PyTorch implementation of BERT in "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
☆107Updated 6 years ago
laiguokun / Funnel-Transformer
☆218Updated 5 years ago
FreedomIntelligence / complex-order
☆83Updated 5 years ago
voidism / DiffCSE
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
☆295Updated 2 years ago
akashkm99 / Interpretable-Attention
Official Code for Towards Transparent and Explainable Attention Models paper (ACL 2020)
☆35Updated 3 years ago
lonePatient / electra_pytorch
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆91Updated 3 years ago
Fraser-Greenlee / T5-VAE
Checkout the new version at the link!
☆22Updated 4 years ago
wenxianxian / demvae
Dispersed Exponential Family Mixture VAE
☆28Updated 5 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
seongminp / transformers-into-vaes
Code for "Finetuning Pretrained Transformers into Variational Autoencoders"
☆39Updated 3 years ago
mailong25 / meta-learning-bert
Meta learning with BERT as a learner
☆109Updated last year
kahne / NonAutoregGenProgress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
☆306Updated 2 years ago
ahmetumutdurmus / awd-lstm
Replication of "Regularizing and Optimizing LSTM Language Models" by Merity et al. (2017).
☆12Updated 5 years ago
XuezheMax / flowseq
Generative Flow based Sequence-to-Sequence Toolkit written in Python.
☆245Updated 5 years ago
chho33 / LAMOL
Code for LAMOL: LAnguage MOdeling for Lifelong Language Learning
☆95Updated 4 years ago
dinghanshen / Cutoff
The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach …
☆63Updated 4 years ago