aryamanarora / schwa-deletion
Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi
☆17Updated last year
Alternatives and similar repositories for schwa-deletion:
Users that are interested in schwa-deletion are comparing it to the libraries listed below
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- SIGTYP 2022 Shared Task☆9Updated 2 years ago
- Python library for converting numbers to words for all Indian Languages.☆34Updated 2 weeks ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆231Updated 5 months ago
- ☆42Updated 2 years ago
- Python Finite-State Toolkit☆47Updated last week
- ☆14Updated 2 years ago
- A guide to building language technology in new languages.☆58Updated 2 years ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 10 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆37Updated last year
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- These are lists for a variety of languages containing words that are distinctive to each language.☆35Updated 2 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- ☆19Updated 3 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆169Updated 5 months ago
- Finite-state script normalization and processing utilities☆38Updated this week
- ☆22Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 9 months ago
- ☆42Updated 3 years ago
- Morfessor EM+Prune☆10Updated 4 years ago
- Linguistic processing for Common Voice☆52Updated last year
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆8Updated 11 months ago
- IPA tokeniser☆16Updated 9 months ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆39Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago