soaxelbrooke / python-bpeLinks
Byte Pair Encoding for Python!
☆231Updated 2 years ago
Alternatives and similar repositories for python-bpe
Users that are interested in python-bpe are comparing it to the libraries listed below
Sorting:
- ☆323Updated 2 years ago
- Fast BPE☆672Updated last year
- Unsupervised Statistical Machine Translation☆229Updated 4 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 8 years ago
- New dataset☆306Updated 3 years ago
- Implementation of a linear-chain CRF in PyTorch☆97Updated 4 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 3 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆251Updated 7 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆230Updated 4 years ago
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆330Updated last year
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated last year
- Python code for various NLP metrics☆168Updated 5 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆249Updated 4 years ago
- A tool for holistic analysis of language generations systems☆471Updated 3 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆344Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆561Updated 3 years ago
- Implementation of NeurIPS 19 paper: Paraphrase Generation with Latent Bag of Words☆122Updated 3 years ago
- LM Pretraining with PyTorch/TPU☆135Updated 5 years ago
- Character-aware Neural Language Model implemented by PyTorch☆35Updated 7 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆497Updated last year
- ☆211Updated last year
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆124Updated 5 years ago
- Neural models and instructions on how to reproduce our results for our neural grammatical error correction systems from M. Junczys-Dowmun…☆88Updated 6 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆211Updated 2 months ago
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Updated 3 years ago
- JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation☆113Updated 2 years ago