google / transliterationLinks
Transliteration data and models
☆56Updated 9 years ago
Alternatives and similar repositories for transliteration
Users that are interested in transliteration are comparing it to the libraries listed below
Sorting:
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆200Updated 5 years ago
- Transliteration module for Indian Languages☆79Updated 2 months ago
- Crawler for linguistic corpora☆213Updated 5 months ago
- Automatic transliteration with LSTM☆92Updated 7 years ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆49Updated 3 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- NLTK Contrib☆169Updated last year
- Inforex is a web system for text corpora construction.☆12Updated 6 months ago
- Resources to go with the Indic NLP Library☆77Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆32Updated 6 months ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆34Updated 3 years ago
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Dockerized NMT frameworks for nmt-wizard☆39Updated 2 years ago
- Corpus preprocessing☆99Updated last year
- Translation demonstrator☆37Updated 5 years ago
- A fast, simple, multilingual tokenizer☆29Updated 8 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Updated 2 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 6 years ago
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation☆36Updated 8 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆88Updated 2 months ago
- ☆12Updated 10 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- German Morphological Analyzer☆51Updated 4 years ago
- MIT Language Modeling Toolkit☆119Updated 6 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆91Updated 7 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 7 years ago