aryamanarora/schwa-deletion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aryamanarora/schwa-deletion)

aryamanarora / schwa-deletion

Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi

☆16

Alternatives and similar repositories for schwa-deletion

Users that are interested in schwa-deletion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

farmersrice / saltzero
View on GitHub
Machine learning bot for ultimate tic-tac-toe based on DeepMind's AlphaGo Zero paper. C++ and Python.
☆27Jan 2, 2026Updated 6 months ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
cldf-clts / clts
View on GitHub
Cross-Linguistic Transcription Systems
☆17Mar 20, 2026Updated 3 months ago
lingpy / lingrex
View on GitHub
Linguistic Reconstruction with LingPy
☆16Aug 5, 2024Updated last year
RamchandraApte / OmniTemplate
View on GitHub
All the code for all the algorithms and data structures and utilities for competitive programming.
☆15Dec 21, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
timarkh / uniparser-grammar-udm
View on GitHub
Morphological analysis for Udmurt.
☆12May 23, 2026Updated last month
bert-nmt / ctx-bert-nmt
View on GitHub
Extend bert-nmt to context-aware translation.
☆11May 24, 2021Updated 5 years ago
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
tshrjn / qgen
View on GitHub
Question generation from Reading Comprehension
☆19Feb 28, 2022Updated 4 years ago
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
shijie-wu / neural-transducer
View on GitHub
This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.
☆77Sep 13, 2023Updated 2 years ago
rhuebler / HOPS
View on GitHub
☆17Nov 19, 2021Updated 4 years ago
nytud / HuLU
View on GitHub
Hungarian Language Understanding Benchmark Kit. Includes evaluation scripts and a Python package for benchmarking models on HuLU.
☆15Oct 29, 2025Updated 8 months ago
tlringer / proof-chat-fun
View on GitHub
playing with gpt4
☆13Mar 17, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
loanwordbank / loanpy
View on GitHub
LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …
☆16Jun 10, 2026Updated last month
juditacs / wikt2dict
View on GitHub
Wiktionary parser tool for many language editions.
☆54Aug 17, 2022Updated 3 years ago
aso2101 / prakrit_texts
View on GitHub
Digital texts in Prakrit
☆11Sep 14, 2025Updated 10 months ago
ayh2bxa / realtime_nkf_aec
View on GitHub
☆18Dec 27, 2023Updated 2 years ago
Joon-Park92 / Zero-Shot-Translation-Transformer
View on GitHub
Zero-Shot Translation implemented by Transformer
☆14Mar 24, 2023Updated 3 years ago
chorusai / arpa2ipa
View on GitHub
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆17Jan 2, 2018Updated 8 years ago
ftruzzi / ensure_vpn
View on GitHub
Make sure you're connected to your favorite VPN before running your Python script.
☆18Mar 25, 2021Updated 5 years ago
alanackart / English-phonetic-transcription
View on GitHub
The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…
☆30Apr 25, 2017Updated 9 years ago
ari-holtzman / newformer
View on GitHub
☆16Jul 20, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mrinaldhar / en-hi-codemixed-corpus
View on GitHub
Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus
☆13Feb 17, 2019Updated 7 years ago
jmccrae / yuzu
View on GitHub
Micro-framework for publishing linked data
☆11Aug 1, 2017Updated 8 years ago
LCS2-IIITD / DaSLaM
View on GitHub
☆17Oct 31, 2023Updated 2 years ago
antimatter15 / js-wikireader
View on GitHub
An Offline Wikipedia Dump Reader in Javascript that probably only works on Chrome
☆19Dec 23, 2011Updated 14 years ago
jessicarick / resources
View on GitHub
Tutorials, templates, etc. for other students
☆18Nov 5, 2025Updated 8 months ago
bhaddow / pmindia-crawler
View on GitHub
Code for extracting parallel corpora from pmindia
☆17Jan 28, 2020Updated 6 years ago
iisys-hof / olaph
View on GitHub
OLaPh (Optimal Language Phonemizer) is a multilingual phonemization framework that converts text into phonemes surpassing the quality of …
☆17Jun 8, 2026Updated last month
cldf / pycldf
View on GitHub
python package to read and write CLDF datasets
☆21Updated this week
sanskrit-lexicon / COLOGNE
View on GitHub
Development of http://www.sanskrit-lexicon.uni-koeln.de/
☆19Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DataSenseiAryan / GoogleSpeechCommandLowFootprint
View on GitHub
This repository contains the Code for SOTA model on Google Speech Command V2 dataset.
☆16Sep 28, 2023Updated 2 years ago
eliyahabba / PromptSuite
View on GitHub
☆16Nov 24, 2025Updated 7 months ago
SapienzaNLP / conception
View on GitHub
Code and experiments for the COLING2020 paper "Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations".
☆11Dec 9, 2020Updated 5 years ago
leimao / Simple-Inference-Server
View on GitHub
Inference Server Implementation from Scratch for Machine Learning Models
☆24Dec 31, 2020Updated 5 years ago
lab260ru / balalaika
View on GitHub
[INTERSPEECH 2026] Official code for "Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech"
☆21Jul 11, 2026Updated last week
QuwsarOhi / BanglaWriting
View on GitHub
BanglaWriting: A multi-purpose offline Bangla handwriting dataset
☆14Nov 18, 2020Updated 5 years ago
Riccorl / sense-embedding
View on GitHub
BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText
☆10Sep 3, 2019Updated 6 years ago