kbatsuren/wiktra

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kbatsuren/wiktra)

kbatsuren / wiktra

Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)

☆37

Alternatives and similar repositories for wiktra

Users that are interested in wiktra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kbatsuren / CogNet
View on GitHub
CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates
☆56Jun 15, 2023Updated 3 years ago
virtualvinodh / aksharamukha-python
View on GitHub
Aksharamukha Python Library
☆62Feb 2, 2025Updated last year
IvanWang0730 / StyleAP
View on GitHub
Code and Data for Paper "Controlling Styles in Neural Machine Translation with Activation Prompt" (ACL 2023 Findings)
☆16Dec 20, 2022Updated 3 years ago
google / transliteration
View on GitHub
Transliteration data and models
☆56Nov 19, 2016Updated 9 years ago
ssmlkl / MnTTS2
View on GitHub
This is the experimental description of MnTTS2.
☆12Apr 11, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DiLi-Lab / PoTeC
View on GitHub
This repository contains the Potsdam Textbook Corpus (PoTeC) which is a natural reading eye-tracking corpus.
☆16Jun 17, 2026Updated last month
amittai / cynical
View on GitHub
Cynical data selection
☆20Jan 16, 2021Updated 5 years ago
nipunsadvilkar / roberta-base-mr
View on GitHub
RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week
☆28Jul 18, 2021Updated 5 years ago
lecs-lab / polygloss
View on GitHub
A massively multilingual corpus and pretrained model for IGT
☆15Jun 4, 2026Updated last month
rnd2110 / MorphAGram
View on GitHub
A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars
☆17Jun 14, 2024Updated 2 years ago
wooorm / trigrams
View on GitHub
Trigram files for 500+ languages
☆24Mar 21, 2025Updated last year
pacscilab / voxangeles
View on GitHub
VoxAngeles Corpus
☆15Aug 23, 2025Updated 10 months ago
ffaisal93 / SD-QA
View on GitHub
☆16Feb 10, 2026Updated 5 months ago
discord / TextAttack
View on GitHub
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…
☆14Nov 23, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
sdtblck / Opensubtitles_dataset
View on GitHub
downloads and parses subtitle dataset from opensubtitles.org
☆15Apr 19, 2024Updated 2 years ago
Hamza5 / multilevel-diacritizer
View on GitHub
Extensible DL-based automatic Arabic diacritization tool allowing the restoration of different types of diacritics.
☆23Jul 25, 2023Updated 2 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
keyreply / Thai-NLP-Dataset
View on GitHub
More than 43+ collections of Thai Natural Language Processing libraries. Update daily.
☆35Aug 22, 2018Updated 7 years ago
amasad / arabish
View on GitHub
Arabic Transliteration in Python
☆36Aug 19, 2013Updated 12 years ago
MarvinLvn / BabySLM
View on GitHub
Behavioral probing of language acquisition models at the lexical and syntactic level
☆20Jul 17, 2023Updated 3 years ago
cloudyr / aws.dynamodb
View on GitHub
Client Package for the Amazon DynamoDB Service
☆13Jan 12, 2020Updated 6 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
steinst / SentAlign
View on GitHub
☆38Mar 16, 2026Updated 4 months ago
xiaody / adblock-minus
View on GitHub
Implements core functionality of adblocking
☆16Feb 23, 2018Updated 8 years ago
matbahasa / TALPCo
View on GitHub
TUFS Asian Language Parallel Corpus
☆53May 1, 2023Updated 3 years ago
huy-nguyen / remark-github-plugin
View on GitHub
Remark plugin to insert code from GitHub into markdown files
☆10Aug 1, 2021Updated 4 years ago
jsilve24 / kisses
View on GitHub
Keep It Simple Stupid Emacs Splash
☆12Sep 10, 2022Updated 3 years ago
aso2101 / prakrit_texts
View on GitHub
Digital texts in Prakrit
☆11Sep 14, 2025Updated 10 months ago
bltlab / paranames
View on GitHub
ParaNames: A multilingual resource for parallel names
☆40May 20, 2024Updated 2 years ago
Gigacore / ToneAIchemy
View on GitHub
In-browser textual tone analyzer using window.ai API
☆12Jul 22, 2024Updated last year
lincollincol / QRecorder
View on GitHub
QRecorder - playback capture api implementation with decoding recording from pcm to mp3 format
☆16Nov 4, 2020Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
aldrichtr / tangld
View on GitHub
A literate dotfiles manager
☆11Jan 28, 2025Updated last year
brummer10 / aloop
View on GitHub
Audio File Looper for Linux
☆17Mar 31, 2025Updated last year
drobilla / zix
View on GitHub
A lightweight C library of portability wrappers and data structures
☆17Jul 3, 2026Updated 2 weeks ago
zwhe99 / LLM-MT-Eval
View on GitHub
{DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}
☆14Jun 18, 2023Updated 3 years ago
neubig / pialign
View on GitHub
pialign - A Phrasal ITG Aligner
☆24Apr 29, 2019Updated 7 years ago
vaishnavimurthy / Akaya-Telivigala
View on GitHub
Telugu + Latin
☆11Apr 29, 2021Updated 5 years ago
bestian / q-moedict
View on GitHub
備用的萌典(moedict pwa & app, Quasar used)
☆12Jul 31, 2025Updated 11 months ago