levyfan / sentencepiece-jniLinks
Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.
☆37Updated 2 years ago
Alternatives and similar repositories for sentencepiece-jni
Users that are interested in sentencepiece-jni are comparing it to the libraries listed below
Sorting:
- Decoding platform for machine translation research☆55Updated 5 years ago
- Symphony Machine Translation☆38Updated 5 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆107Updated 2 years ago
- Subword Language Model for Query Auto-Completion☆67Updated 5 years ago
- Dialog State Tracking Challenge 6 (DSTC6)☆54Updated 7 years ago
- Corpus preprocessing☆97Updated last year
- Symmetric Delete spelling correction algorithm using Java☆14Updated 10 months ago
- A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.☆96Updated 4 years ago
- Embedding Quantization (Compress Word Embeddings)☆86Updated 5 years ago
- Knowledge Distillation For Transformer Language Models☆52Updated last year
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆124Updated 5 years ago
- Multiple Different Natural Language Processing Tasks in a Single Deep Model☆48Updated 6 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆157Updated 6 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆152Updated 4 years ago
- Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)☆25Updated 3 years ago
- An extension of word2vec to learn phrase embeddings☆75Updated 6 years ago
- ☆42Updated 7 years ago
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)☆200Updated 2 years ago
- NoiseMix - data generation for natural language☆40Updated 7 years ago
- Triangular-chain CRF☆25Updated 9 years ago
- This repo includes extensions to the Stanford Dialogue Corpus. It contains crowd-sourced rewrites to facilitate research in dialogue stat…☆90Updated 6 years ago
- PyTorch implementation of StarSpace as described in "StarSpace: Embed All The Things!" by Ledell Wu, Adam Fisch, Sumit Chopra, Keith Adam…☆50Updated 7 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Personalized Query Completion☆27Updated 4 years ago
- Keras implementation of CoVe☆50Updated 6 years ago
- Code for reproducing the results from the paper Few Shot Text Classification with a Human in the Loop☆90Updated 7 years ago
- Phrase-Indexed Question Answering (PIQA)☆94Updated 6 years ago
- Source code for the paper "Morphological Inflection Generation with Hard Monotonic Attention"☆37Updated 7 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆164Updated 4 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago