google / transliteration
Transliteration data and models
☆53Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for transliteration
- Thot toolkit for statistical machine translation☆50Updated last year
- Transliteration module for Indian Languages☆77Updated 11 months ago
- Automatic transliteration with LSTM☆92Updated 5 years ago
- Resources to go with the Indic NLP Library☆71Updated 2 years ago
- Xlit-Crowd: Hindi-English Transliteration Corpus☆37Updated 9 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆185Updated 4 years ago
- Sentence aligner☆108Updated 3 years ago
- Fast approximate strings search & spelling correction☆57Updated 3 years ago
- NLTK Contrib☆166Updated 8 months ago
- Program used to split text into segments☆25Updated 2 weeks ago
- ☆14Updated last year
- Corpus preprocessing☆95Updated 7 months ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆43Updated 2 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated last year
- Collaborative on-line editor for aligned parallel texts.☆12Updated 2 years ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- MIT Language Modeling Toolkit☆115Updated 4 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated last year
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 5 years ago
- Automatic extraction of edited sentences from text edition histories.☆81Updated 2 years ago
- Dockerized NMT frameworks for nmt-wizard☆39Updated last year
- Data collection, alignment and TAUS repository☆20Updated 6 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- Efficient Markov Chain word alignment☆54Updated 3 years ago
- PurePos is an open source hybrid morphological tagger.☆15Updated 4 years ago
- Lexical data at Unicode☆65Updated 2 months ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆112Updated 2 years ago