vincentzlt / textprepLinks
Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Translation tasks. It is designed especially for logographic languages such as Chinese and Japanese.
☆32Updated 6 years ago
Alternatives and similar repositories for textprep
Users that are interested in textprep are comparing it to the libraries listed below
Sorting:
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- An attentional NMT model in Dynet☆26Updated 6 years ago
- ☆28Updated 8 years ago
- Cross-lingual Dependency Parsing Based on Distributed Representations☆20Updated 7 years ago
- Implementation of "Controlling Output Length in Neural Encoder-Decoders"☆42Updated 7 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Updated 3 years ago
- ☆42Updated 6 years ago
- Code for NAACL19 Paper "How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection"☆42Updated 5 years ago
- ☆44Updated 7 years ago
- ☆66Updated 2 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆39Updated 6 years ago
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Updated 7 years ago
- ☆47Updated 8 years ago
- This repo is for residual-connected sentence encoder for NLI.☆11Updated 7 years ago
- Code for upcoming TACL paper w/ Graham Neubig, "Neural Lattice Language Models".☆48Updated 7 years ago
- Attempt at using LSTMs to predict semantic relatedness of sentences (a la Tai et al. in Improved Semantic Representations From Tree-Struc…☆22Updated 9 years ago
- Language modeling scripts based on TensorFlow☆58Updated 5 years ago
- A latent-variable model for learning bilingual word embedding mappings☆18Updated 6 years ago
- MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models☆23Updated 6 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Updated 2 years ago
- ☆25Updated 9 years ago
- Symphony Machine Translation☆38Updated 5 years ago
- Very Deep Pairwise Word Interaction Neural Networks for modeling textual similarity (He and Lin, NAACL/HLT 2016)☆18Updated 7 years ago
- Dataset for CIKM 2018 paper "Question Headline Generation for News Articles"☆9Updated 6 years ago
- A sentence encoding-based model for natural language inference☆31Updated 7 years ago
- An implementation of the Globally Normalized Reader☆58Updated 2 years ago
- Datasets for Question Answering by Search and Reading☆69Updated 7 years ago
- Cynical data selection☆20Updated 4 years ago