lostromb / pocketsphinx-kws
Pure C# port of the Pocketsphinx keyword spotter
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for pocketsphinx-kws
- Perform the forced decoding with target transcription☆11Updated 6 years ago
- Chinese Mandarin Synthesis Corpus-Female/Emotional☆11Updated 3 months ago
- ☆11Updated last year
- Hifi-like Vocoder implemented in PyTorch☆13Updated 2 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- Unsupervised speech activity detection system.☆11Updated 6 years ago
- Tensorflow implementation of pix2pix(cGAN) for audio source separation☆15Updated 6 years ago
- ☆10Updated last year
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through ref…☆5Updated 3 months ago
- Piper TTS Integration using C#☆13Updated 2 months ago
- python wrap for hts engine☆14Updated 6 years ago
- ☆10Updated 2 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago
- ☆11Updated 3 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated 10 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- ☆13Updated 2 months ago
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆14Updated 3 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 5 years ago
- ☆9Updated last month
- Multispeaker Community Vocoder Model for DiffSinger☆35Updated 6 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆14Updated this week
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆22Updated 10 months ago