soaxelbrooke / python-bpe
Byte Pair Encoding for Python!
☆227Updated 2 years ago
Alternatives and similar repositories for python-bpe:
Users that are interested in python-bpe are comparing it to the libraries listed below
- Fast BPE☆659Updated 7 months ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆133Updated last year
- ☆319Updated 2 years ago
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆243Updated 3 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆490Updated 8 months ago
- Unsupervised Statistical Machine Translation☆229Updated 4 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆226Updated 4 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 2 years ago
- Pytorch implementation of "A Probabilistic Formulation of Unsupervised Text Style Transfer" by He. et. al. at ICLR 2020☆163Updated 2 years ago
- Calculating ROUGE score between two files (line-by-line)☆192Updated 3 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆325Updated last year
- Implementation of NeurIPS 19 paper: Paraphrase Generation with Latent Bag of Words☆122Updated 3 years ago
- ☆316Updated 3 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.☆157Updated 5 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆340Updated 2 years ago
- Easily fine tune GPT-2 to fill in missing text☆196Updated 2 years ago
- ☆360Updated 2 years ago
- ICLR 2018 Quick-Thought vectors☆204Updated 5 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆250Updated 6 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- A list of resources about Text Style Transfer☆59Updated 4 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 2 years ago
- Neural models and instructions on how to reproduce our results for our neural grammatical error correction systems from M. Junczys-Dowmun…☆88Updated 5 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 3 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 7 years ago