lingjzhu/clap-ipa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lingjzhu/clap-ipa)

lingjzhu / clap-ipa

Keyword spotting and forced alignment in any language

☆100

Alternatives and similar repositories for clap-ipa

Users that are interested in clap-ipa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lingjzhu / zipa
View on GitHub
A family of efficient speech models for multilingual phone recognition
☆68Jul 18, 2026Updated last week
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆29Jun 9, 2026Updated last month
mrusci / ondevice-learning-kws
View on GitHub
Test Framework for few-shot open set KWS
☆45Nov 8, 2024Updated last year
lingjzhu / CharsiuG2P
View on GitHub
Multilingual G2P in 100 languages
☆391May 26, 2023Updated 3 years ago
lingjzhu / charsiu
View on GitHub
Charsiu: A neural phonetic aligner.
☆347Sep 19, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xinjli / ucla-phonetic-corpus
View on GitHub
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆46May 12, 2023Updated 3 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
HolgerBovbjerg / data2vec-KWS
View on GitHub
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆32Mar 6, 2025Updated last year
maxrmorrison / promonet
View on GitHub
Prosody and Pronunciation Modification Network
☆64May 5, 2025Updated last year
gusrud1103 / LibriPhrase
View on GitHub
Recipe for LibriPhrase
☆38Sep 2, 2023Updated 2 years ago
swagshaw / TorchKWS
View on GitHub
Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.
☆41Apr 5, 2024Updated 2 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
aizhiqi-work / MM-KWS
View on GitHub
Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆51Jan 24, 2026Updated 6 months ago
iisys-hof / olaph
View on GitHub
OLaPh (Optimal Language Phonemizer) is a multilingual phonemization framework that converts text into phonemes surpassing the quality of …
☆17Jul 20, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ncsoft / PhonMatchNet
View on GitHub
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆63Jun 3, 2024Updated 2 years ago
pacscilab / voxangeles
View on GitHub
VoxAngeles Corpus
☆15Aug 23, 2025Updated 11 months ago
yfyeung / DS-WED
View on GitHub
[ICASSP 2026] Official code for "Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration"
☆17Apr 16, 2026Updated 3 months ago
dobby-seo / Wav2Keyword
View on GitHub
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆110Jan 11, 2023Updated 3 years ago
kaistmm / Metric-UD-KWS
View on GitHub
Official code for Metric learning for user-defined keyword spotting
☆40Feb 21, 2024Updated 2 years ago
NRC-ILT / g2p
View on GitHub
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆203Updated this week
kgnlp / allophant
View on GitHub
A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.
☆30Mar 14, 2025Updated last year
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
dianwen-ng / Keyword-Spotting-ConvMixer
View on GitHub
☆33Aug 10, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
Qualcomm-AI-research / bcresnet
View on GitHub
☆100May 31, 2023Updated 3 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
dmort27 / panphon
View on GitHub
Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.
☆320Oct 22, 2025Updated 9 months ago
xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Updated this week
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆133Apr 8, 2026Updated 3 months ago
k2-fsa / libriheavy
View on GitHub
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
☆220Sep 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Interlagos / TENet-kws
View on GitHub
Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)
☆32Nov 11, 2020Updated 5 years ago
harvard-edge / multilingual_kws
View on GitHub
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆190Dec 6, 2024Updated last year
yqcai888 / DCASE2023
View on GitHub
2022 DCASE Challenge
☆14Sep 30, 2024Updated last year
thelinhbkhn2014 / Text2PhonemeSequence
View on GitHub
☆53Aug 28, 2024Updated last year
X-LANCE / KWStreamingSearch
View on GitHub
☆94Jun 25, 2025Updated last year
haoheliu / SemantiCodec-inference
View on GitHub
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
☆255Mar 7, 2025Updated last year
Wataru-Nakata / ssl-vocoders
View on GitHub
Implementation of vocoders empowered with pytorch lightning
☆18Jan 27, 2024Updated 2 years ago