ronggong / phoneticSimilarity
phonetic similarity algorithms
☆12Updated 6 years ago
Alternatives and similar repositories for phoneticSimilarity:
Users that are interested in phoneticSimilarity are comparing it to the libraries listed below
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- ☆22Updated 3 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 9 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆23Updated 4 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 8 months ago
- ☆16Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- ☆56Updated 2 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Grapheme to phoneme converter for Estonian☆13Updated 3 years ago
- ☆12Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆17Updated this week
- ☆12Updated 7 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Port of Funasr's Paraformer model in C/C++☆26Updated 7 months ago
- ☆20Updated 5 years ago
- Perform the forced decoding with target transcription☆11Updated 6 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Updated 5 years ago
- A Text2Speech Engine built in Pytorch.☆11Updated 6 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 2 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…☆17Updated 8 years ago