soaxelbrooke / python-bpeLinks
Byte Pair Encoding for Python!
☆231Updated 3 years ago
Alternatives and similar repositories for python-bpe
Users that are interested in python-bpe are comparing it to the libraries listed below
Sorting:
- Fast BPE☆678Updated last year
- ☆324Updated 2 years ago
- Unsupervised Statistical Machine Translation☆229Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 4 years ago
- Implementation of a linear-chain CRF in PyTorch☆97Updated 4 years ago
- New dataset☆308Updated 4 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆495Updated last year
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆231Updated 4 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆125Updated 8 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆347Updated 2 years ago
- A tool for holistic analysis of language generations systems☆472Updated last month
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆250Updated 7 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 4 months ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆330Updated last year
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 3 years ago
- eXtensible Neural Machine Translation☆185Updated last month
- LASER multilingual sentence embeddings as a pip package☆225Updated 2 years ago
- A framework to learn cross-lingual word embedding mappings☆649Updated 2 years ago
- Python code for various NLP metrics☆168Updated 6 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆227Updated 4 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆152Updated 3 years ago
- Evaluating Cross-lingual Sentence Representations☆458Updated 4 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆249Updated 4 years ago
- Easily fine tune GPT-2 to fill in missing text☆201Updated 2 years ago
- Minimal tutorial on packing and unpacking sequences in pytorch☆210Updated 6 years ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆231Updated 3 years ago
- Character-aware Neural Language Model implemented by PyTorch☆35Updated 7 years ago
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆123Updated last month
- Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.☆160Updated 6 years ago