kpu / kenlm
KenLM: Faster and Smaller Language Model Queries
☆2,600Updated last month
Alternatives and similar repositories for kenlm:
Users that are interested in kenlm are comparing it to the libraries listed below
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,232Updated 9 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,905Updated 2 years ago
- Neural machine translation and sequence learning using TensorFlow☆1,468Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,213Updated 7 months ago
- Simple, fast unsupervised word aligner☆752Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,106Updated last year
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,619Updated 2 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,863Updated 2 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,119Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,353Updated last year
- PyTorch CTC Decoder bindings☆835Updated last year
- Pre-trained ELMo Representations for Many Languages☆1,461Updated 3 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,385Updated 3 years ago
- Language-Agnostic SEntence Representations☆3,634Updated last year
- Tools to download and cleanup Common Crawl data☆1,002Updated 2 years ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,219Updated last year
- Open-Source Neural Machine Translation in Tensorflow☆797Updated 2 years ago
- Fast Neural Machine Translation in C++☆1,321Updated last year
- Fast BPE☆670Updated 10 months ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,128Updated last month
- The Natural Language Decathlon: A Multitask Challenge for NLP☆2,349Updated this week
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,536Updated last year
- ☆3,645Updated 2 years ago
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations☆3,267Updated 2 years ago
- Translate - a PyTorch Language Library☆833Updated 2 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,895Updated 2 years ago
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,212Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,187Updated last year
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,388Updated 3 years ago
- Speech Recognition using DeepSpeech2.☆2,115Updated 2 years ago