Bi-directional transliterator for Python. Transliterates (unicode) strings according to the rules specified in the language packs.
☆310Aug 29, 2023Updated 2 years ago
Alternatives and similar repositories for transliterate
Users that are interested in transliterate are comparing it to the libraries listed below
Sorting:
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Jun 29, 2025Updated 8 months ago
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆1,166Jun 26, 2024Updated last year
- Transliterate Cyrillic → Latin in every possible way☆126Feb 4, 2026Updated last month
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- Solves basic Russian NLP tasks, API for lower level Natasha projects☆1,315Oct 17, 2024Updated last year
- ☆24Apr 15, 2021Updated 4 years ago
- Tools for handling GRNTI list☆10Sep 2, 2023Updated 2 years ago
- auth and permissions for aiohttp☆238Updated this week
- Набор гайдов, которые использует команда разработки BestDoctor☆260Mar 22, 2024Updated last year
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Sep 30, 2019Updated 6 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆12Apr 7, 2015Updated 10 years ago
- ☆12Apr 2, 2024Updated last year
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- Promoss Topic Modelling Toolbox☆11Jan 21, 2019Updated 7 years ago
- Simple app to send server pushed messages from Python using NodeJS - SocketIO - Redis - Nginx☆23Mar 2, 2014Updated 12 years ago
- Russian/English/Estonian/Finnish/Swedish phonetic algorithm based on Soundex and Metaphone☆52Mar 1, 2025Updated last year
- Study on lexibank data (presenting the lexibank dataset).☆15Apr 11, 2025Updated 10 months ago
- Course in Natural Language Processing and Applications☆10Oct 4, 2022Updated 3 years ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Oct 15, 2018Updated 7 years ago
- Copy structure of your Postgres DBs as Markdown to prompt LLMs better!☆14Feb 23, 2025Updated last year
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- Topic supervised non-negative matrix factorization with sparse matrices☆12Mar 24, 2020Updated 5 years ago
- A massively multilingual corpus and pretrained model for IGT☆14Feb 21, 2026Updated 2 weeks ago
- Helpers for atomic file writes☆11Jul 4, 2014Updated 11 years ago
- command line resource for working with digital primary sources☆28Aug 3, 2018Updated 7 years ago
- Multilingual text (NLP) processing toolkit☆2,366Nov 10, 2023Updated 2 years ago
- Simple threaded cassandra wrapper for asyncio☆85Dec 15, 2020Updated 5 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Oct 27, 2021Updated 4 years ago
- WordWanderer – take your text for a walk☆12May 14, 2019Updated 6 years ago
- Chromium- and Firefox-compatible extension to add downloads to ocDownloader directly form your browser☆14May 17, 2019Updated 6 years ago
- A downloader and reader for the OSM planet files.☆16Dec 18, 2021Updated 4 years ago
- Speech Processing & Linguistic Analysis Tool☆11Jun 30, 2019Updated 6 years ago
- Sequence algorithms for use in Flashlight.☆14Jan 12, 2026Updated last month
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12May 18, 2017Updated 8 years ago
- ☆13Dec 12, 2016Updated 9 years ago
- Tuddar, ismawen d imeḍqan☆10Jan 3, 2020Updated 6 years ago
- A framework for Oxygen XML Editor allowing researchers to transcribe historical documents in TEI☆21Jun 24, 2024Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆240Jul 26, 2024Updated last year