rsennrich / subword-nmtLinks
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,264Updated last year
Alternatives and similar repositories for subword-nmt
Users that are interested in subword-nmt are comparing it to the libraries listed below
Sorting:
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,923Updated 2 years ago
- Moses, the machine translation system☆1,620Updated 9 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,107Updated last year
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,122Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,218Updated last year
- Open-Source Neural Machine Translation in Tensorflow☆805Updated 3 years ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,220Updated last week
- Neural machine translation and sequence learning using TensorFlow☆1,487Updated 2 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,613Updated 2 years ago
- KenLM: Faster and Smaller Language Model Queries☆2,716Updated 9 months ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,368Updated last year
- Pre-trained ELMo Representations for Many Languages☆1,462Updated 4 years ago
- ☆3,679Updated 3 years ago
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,236Updated 3 years ago
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆2,193Updated 3 years ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,524Updated 4 years ago
- Fast BPE☆678Updated last year
- InferSent sentence embeddings☆2,279Updated 4 years ago
- Phrase-Based & Neural Unsupervised Machine Translation☆1,504Updated 4 years ago
- An open source framework for seq2seq models in PyTorch.☆1,515Updated 4 months ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,258Updated last year
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,897Updated 3 years ago
- A machine translation reading list maintained by Tsinghua Natural Language Processing Group☆2,440Updated last year
- Simple, fast unsupervised word aligner☆765Updated 3 years ago
- An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group☆707Updated 3 years ago
- Language-Agnostic SEntence Representations☆3,660Updated last year
- Code for paper Fine-tune BERT for Extractive Summarization☆1,504Updated 4 years ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,268Updated 6 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,540Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,174Updated 2 years ago