YerevaNN / translit-rnn
Automatic transliteration with LSTM
☆92Updated 5 years ago
Related projects: ⓘ
- ☆34Updated 7 years ago
- A sentence aligner for comparable corpora☆127Updated 8 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆180Updated 3 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆120Updated 3 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆122Updated 7 years ago
- Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind…☆76Updated 5 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆73Updated 3 years ago
- ☆68Updated last year
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆124Updated last year
- Language Identification and transliteration tool for Indian language code mixed data.☆23Updated 8 years ago
- ☆56Updated 6 years ago
- A Multilingual and Multilevel Representation Learning Toolkit for NLP☆117Updated 6 years ago
- Automatic extraction of edited sentences from text edition histories.☆80Updated 2 years ago
- Fast Word Clustering Software☆74Updated last month
- Code to train and use models from "Charagram: Embedding Words and Sentences via Character n-grams".☆125Updated 8 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago
- Python library for converting UTF to WX and vice-versa for Indian languages.☆48Updated 2 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 5 years ago
- State-of-the-art Supervised Sentence Simplification System from ACL 2014☆47Updated 5 years ago
- ☆23Updated 6 years ago
- A series of scripts to download and parse the OpenSubtitles corpus.☆86Updated 8 years ago
- ☆29Updated 6 years ago
- Unsupervised Statistical Machine Translation☆227Updated 4 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- ☆23Updated 7 years ago
- TheanoLM is a recurrent neural network language modeling tool implemented using Theano☆81Updated 3 months ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- Utility scripts in Python☆37Updated 3 weeks ago
- ☆55Updated 9 years ago