vincentzlt / textprep
Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Translation tasks. It is designed especially for logographic languages such as Chinese and Japanese.
☆32Updated 5 years ago
Alternatives and similar repositories for textprep:
Users that are interested in textprep are comparing it to the libraries listed below
- Very Deep Pairwise Word Interaction Neural Networks for modeling textual similarity (He and Lin, NAACL/HLT 2016)☆19Updated 6 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆35Updated 5 years ago
- ☆66Updated 2 years ago
- An attentional NMT model in Dynet☆26Updated 6 years ago
- A latent-variable model for learning bilingual word embedding mappings☆18Updated 5 years ago
- ☆45Updated 7 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆43Updated 3 years ago
- Cross-lingual Dependency Parsing Based on Distributed Representations☆20Updated 6 years ago
- Code for NAACL19 Paper "How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection"☆42Updated 5 years ago
- ☆48Updated 7 years ago
- A sentence encoding-based model for natural language inference☆31Updated 6 years ago
- Implementation of "Controlling Output Length in Neural Encoder-Decoders"☆42Updated 6 years ago
- Code for upcoming TACL paper w/ Graham Neubig, "Neural Lattice Language Models".☆48Updated 7 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- ☆26Updated 8 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆77Updated 2 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆20Updated 4 years ago
- ☆43Updated 6 years ago
- MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models☆23Updated 6 years ago
- ACL2015_code_Gated Recursive Neural Network for Chinese Word Segmentation☆28Updated 9 years ago
- Author implementation of "Learning Recurrent Span Representations for Extractive Question Answering" (Lee et al. 2016)☆33Updated 7 years ago
- LSTM Language Model with Subword Units Input Representations☆43Updated 3 years ago
- ☆18Updated 7 years ago
- ☆20Updated 6 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆40Updated 6 years ago
- Multilingual hierarchical attention networks toolkit☆78Updated 5 years ago
- Pre-training character n-gram embeddings☆23Updated last year
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Updated 7 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆64Updated 2 years ago