rsennrich / subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,219Updated 6 months ago
Alternatives and similar repositories for subword-nmt:
Users that are interested in subword-nmt are comparing it to the libraries listed below
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,194Updated 4 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,899Updated 2 years ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,101Updated last month
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,118Updated 2 years ago
- Open-Source Neural Machine Translation in Tensorflow☆795Updated 2 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,618Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,094Updated 11 months ago
- Pre-trained ELMo Representations for Many Languages☆1,462Updated 3 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,892Updated 2 years ago
- Moses, the machine translation system☆1,591Updated 2 weeks ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,384Updated 3 years ago
- Fast BPE☆662Updated 8 months ago
- Simple, fast unsupervised word aligner☆744Updated 2 years ago
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,204Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,347Updated 10 months ago
- BERT-related papers☆2,037Updated last year
- An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group☆705Updated 2 years ago
- InferSent sentence embeddings☆2,285Updated 3 years ago
- KenLM: Faster and Smaller Language Model Queries☆2,557Updated 6 months ago
- Phrase-Based & Neural Unsupervised Machine Translation☆1,503Updated 3 years ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,214Updated last year
- Neural machine translation and sequence learning using TensorFlow☆1,460Updated last year
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,182Updated last year
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,245Updated 11 months ago
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,413Updated last year
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,533Updated last year
- Code for paper Fine-tune BERT for Extractive Summarization☆1,475Updated 3 years ago
- Language-Agnostic SEntence Representations☆3,617Updated 9 months ago
- ☆3,635Updated 2 years ago
- An open source framework for seq2seq models in PyTorch.☆1,506Updated 2 years ago