google / transliterationLinks
Transliteration data and models
☆56Updated 8 years ago
Alternatives and similar repositories for transliteration
Users that are interested in transliteration are comparing it to the libraries listed below
Sorting:
- Crawler for linguistic corpora☆208Updated last month
- Transliteration module for Indian Languages☆79Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 5 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Updated 2 years ago
- Resources to go with the Indic NLP Library☆76Updated 3 years ago
- Automatic transliteration with LSTM☆92Updated 6 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- SymSpellCompound: compound aware automatic spelling correction☆65Updated 7 years ago
- Corpus preprocessing☆99Updated last year
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Translation demonstrator☆34Updated 5 years ago
- Sentence aligner☆117Updated 4 years ago
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation☆36Updated 8 years ago
- MIT Language Modeling Toolkit☆117Updated 5 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 3 months ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆296Updated 11 months ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆50Updated 3 years ago
- Fast approximate strings search & spelling correction☆59Updated 3 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Collaborative on-line editor for aligned parallel texts.☆13Updated 3 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 7 years ago
- NLTK Contrib☆166Updated last year
- a pytorch implementation of auto-punctuation learned character by character☆141Updated 4 years ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆348Updated 3 years ago
- Transliterating English to Hindi using Recurrent Neural Networks☆45Updated 8 years ago
- ☆69Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated 3 months ago
- An English to Hindi Dictionary☆28Updated 5 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago