kpu / kenlm
KenLM: Faster and Smaller Language Model Queries
☆2,544Updated 5 months ago
Alternatives and similar repositories for kenlm:
Users that are interested in kenlm are comparing it to the libraries listed below
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,213Updated 5 months ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,192Updated 3 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,898Updated last year
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,120Updated 2 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,892Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,345Updated 9 months ago
- Neural machine translation and sequence learning using TensorFlow☆1,458Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,091Updated 9 months ago
- A tool for extracting plain text from Wikipedia dumps☆3,783Updated 7 months ago
- Pre-trained ELMo Representations for Many Languages☆1,461Updated 3 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,862Updated 2 years ago
- General purpose unsupervised sentence representations☆1,198Updated 2 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,620Updated last year
- Open-Source Neural Machine Translation in Tensorflow☆797Updated 2 years ago
- PyTorch CTC Decoder bindings☆831Updated 9 months ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,180Updated last year
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,244Updated 10 months ago
- Simple, fast unsupervised word aligner☆742Updated 2 years ago
- InferSent sentence embeddings☆2,285Updated 3 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,532Updated last year
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,374Updated 2 years ago
- NLP made easy☆2,552Updated last year
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,245Updated 2 years ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,212Updated last year
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,199Updated 2 years ago
- An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group☆706Updated 2 years ago
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,413Updated last year
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆2,180Updated 2 years ago
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,849Updated 3 years ago
- ☆1,259Updated 2 years ago