Finite-state script normalization and processing utilities
☆52Jun 24, 2026Updated last week
Alternatives and similar repositories for nisaba
Users that are interested in nisaba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Read-only unofficial mirror of OpenFst☆44May 15, 2022Updated 4 years ago
- Read-only unofficial mirror of Pynini☆17May 7, 2019Updated 7 years ago
- Read-only unofficial mirror of the OpenGrm Thrax Grammar Development Tools☆16May 2, 2019Updated 7 years ago
- Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository☆27Jul 26, 2020Updated 5 years ago
- Collection of auditory models.☆34Feb 4, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 5 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 2 months ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆18Aug 4, 2025Updated 11 months ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 8 months ago
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Random notes on Python internationalisation☆19Aug 10, 2023Updated 2 years ago
- unicodedata backport/updates☆39Mar 5, 2026Updated 3 months ago
- Massively multilingual pronunciation mining☆368May 23, 2026Updated last month
- [LREC 2024] 🖋 Resource and Tool for Writing System Identification☆22Mar 29, 2026Updated 3 months ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 10 years ago
- ☆17Jul 29, 2018Updated 7 years ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- A human-annotated morphosyntactic treebank for Turkish.☆34Nov 18, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The website of the Oscar Project