vincentzlt / textprepLinks
Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Translation tasks. It is designed especially for logographic languages such as Chinese and Japanese.
☆32Updated 6 years ago
Alternatives and similar repositories for textprep
Users that are interested in textprep are comparing it to the libraries listed below
Sorting:
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- ☆66Updated 2 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Updated 3 years ago
- An implementation of the Globally Normalized Reader☆58Updated 2 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- ☆44Updated 7 years ago
- ☆47Updated 8 years ago
- Implementation of "Controlling Output Length in Neural Encoder-Decoders"☆42Updated 7 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- ☆42Updated 7 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆39Updated 7 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Updated 2 years ago
- Language modeling scripts based on TensorFlow☆58Updated 6 years ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆32Updated 8 years ago
- Modularizing Unsupervised Sense Embedding☆29Updated 7 years ago
- An attentional NMT model in Dynet☆26Updated 6 years ago
- Decomposable Attention Model for Sentence Pair Classification (from https://arxiv.org/abs/1606.01933)☆95Updated 8 years ago
- Datasets for Question Answering by Search and Reading☆70Updated 7 years ago
- Pytorch implementation of "Get to the point: Get To The Point: Summarization with Pointer-Generator Networks"☆76Updated 8 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆20Updated 5 years ago
- ☆28Updated 9 years ago
- BiLSTM-CRF for sequence labeling in Dynet☆81Updated 8 years ago
- Automatically exported from code.google.com/p/jacana☆37Updated 10 years ago
- A sentence encoding-based model for natural language inference☆31Updated 7 years ago
- utility class for building/evaluating document representations☆53Updated 5 years ago
- Attempt at using LSTMs to predict semantic relatedness of sentences (a la Tai et al. in Improved Semantic Representations From Tree-Struc…☆22Updated 9 years ago
- Code for NAACL19 Paper "How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection"☆42Updated 6 years ago
- Answer Sentence Selection using Deep Learning☆63Updated 9 years ago
- MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models☆23Updated 7 years ago
- Decoding platform for machine translation research☆55Updated 6 years ago