soaxelbrooke / python-bpeLinks
Byte Pair Encoding for Python!
☆231Updated 3 years ago
Alternatives and similar repositories for python-bpe
Users that are interested in python-bpe are comparing it to the libraries listed below
Sorting:
- Fast BPE☆677Updated last year
- ☆324Updated 2 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 4 years ago
- Unsupervised Statistical Machine Translation☆228Updated 5 years ago
- Implementation of a linear-chain CRF in PyTorch☆97Updated 4 years ago
- A tool for holistic analysis of language generations systems☆472Updated last week
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆251Updated 7 years ago
- New dataset☆307Updated 4 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆249Updated 4 years ago
- Python code for various NLP metrics☆168Updated 5 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆231Updated 4 years ago
- Easily fine tune GPT-2 to fill in missing text☆201Updated 2 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆125Updated 8 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆347Updated 2 years ago
- Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.☆160Updated 6 years ago
- eXtensible Neural Machine Translation☆187Updated last week
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- A framework to learn cross-lingual word embedding mappings☆647Updated 2 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆330Updated last year
- Minimalist implementation of a BERT Sentence Classifier with PyTorch Lightning, Transformers and PyTorch-NLP.☆219Updated 2 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Updated 5 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Updated 3 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆211Updated 3 months ago
- Implementation of NeurIPS 19 paper: Paraphrase Generation with Latent Bag of Words☆122Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆561Updated 3 years ago
- ☆212Updated last year
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 3 years ago
- ICLR 2018 Quick-Thought vectors☆204Updated 6 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- Python port of Moses tokenizer, truecaser and normalizer☆495Updated last year