neurlang/dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neurlang/dataset)

neurlang / dataset

IPA Phonetic dataset lexicon

☆18

Alternatives and similar repositories for dataset

Users that are interested in dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neurlang / goruut
View on GitHub
IPA Phonemizer/Dephonemizer for 140 human languages
☆61Jun 20, 2026Updated last month
neurlang / gospeak
View on GitHub
A Golang Text to Speech System
☆17Feb 16, 2026Updated 5 months ago
iisys-hof / olaph
View on GitHub
OLaPh (Optimal Language Phonemizer) is a multilingual phonemization framework that converts text into phonemes surpassing the quality of …
☆17Updated this week
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
codebyzeb / g2p-plus
View on GitHub
Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories
☆19Apr 10, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆26Jun 9, 2026Updated last month
gladiaio / normalization
View on GitHub
A lightweight library for normalizing speech transcripts before computing WER
☆28Jul 14, 2026Updated last week
42io / tflite_kws
View on GitHub
☆13May 1, 2026Updated 2 months ago
Naozumi520 / g2pW-Cantonese
View on GitHub
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆15Dec 10, 2024Updated last year
xmos / fwk_voice
View on GitHub
Voice Framework
☆18Jan 21, 2026Updated 6 months ago
ysharma3501 / LayaCodec
View on GitHub
High fidelity neural audio codec for TTS models
☆36Dec 22, 2025Updated 7 months ago
sentencizer / sentencizer
View on GitHub
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and works out-of-the-box.
☆49Aug 31, 2025Updated 10 months ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
salesforce / speech-datasets
View on GitHub
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…
☆15Jun 25, 2026Updated 3 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jqueguiner / spleeter-as-a-service
View on GitHub
API implementation of Song Source spleeting from Spleeter by Deezer
☆13Mar 21, 2020Updated 6 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
WhissleAI / PromptingNemo
View on GitHub
All-in-one Speech Transcription
☆11Jun 5, 2026Updated last month
pirxus / personalVAD
View on GitHub
An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.
☆90Sep 22, 2022Updated 3 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
Flux9665 / ArticulatoryTextFrontend
View on GitHub
This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…
☆14Sep 23, 2024Updated last year
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆78Apr 6, 2026Updated 3 months ago
bjnortier / whisper-tflite-ios
View on GitHub
☆19Nov 4, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
xuancong84 / singapore-address-heatmap
View on GitHub
A database and crawling script for Singapore postal code, address name and geo-coordinates
☆14Jul 29, 2020Updated 5 years ago
pengzhendong / asr-decoder
View on GitHub
CTC decoder with hotwords for ASR.
☆38Jun 15, 2026Updated last month
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
ThomasHaubner / e2e_dnn_ad_control_for_lin_aec
View on GitHub
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
☆45Nov 17, 2023Updated 2 years ago
ChasTechProjects / Debian64Pi-old
View on GitHub
64-bit Debian Stretch for the Raspberry Pi 3
☆12Dec 27, 2018Updated 7 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
wangchengzhong / GRE-Net
View on GitHub
Official Repository for "Global Rotation Equivariant Phase Modeling for Speech Enhancement with Deep Magnitude-Phase Interaction"
☆19Jun 25, 2026Updated 3 weeks ago
lingjzhu / zipa
View on GitHub
A family of efficient speech models for multilingual phone recognition
☆68Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TeaPoly / speexdsp-ns-python
View on GitHub
Python bindings of speexdsp noise suppression library
☆49Nov 18, 2022Updated 3 years ago
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 3 years ago
morrisalp / taatiknet
View on GitHub
Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.
☆16Jun 27, 2023Updated 3 years ago
ankurdhuriya / multispeaker-glow-tts
View on GitHub
☆11Jan 28, 2022Updated 4 years ago
gheyret / UQSpeechDataset
View on GitHub
Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット
☆35Apr 3, 2022Updated 4 years ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
evanshortiss / yr.no-interface
View on GitHub
Wrapper for the yr.no weather service API.
☆15Apr 12, 2018Updated 8 years ago