lostromb / pocketsphinx-kwsLinks
Pure C# port of the Pocketsphinx keyword spotter
☆13Updated 6 years ago
Alternatives and similar repositories for pocketsphinx-kws
Users that are interested in pocketsphinx-kws are comparing it to the libraries listed below
Sorting:
- ☆22Updated 4 years ago
- wake word spotting with kaldi☆19Updated 5 years ago
- ☆13Updated 4 years ago
- Perform the forced decoding with target transcription☆11Updated 7 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆23Updated 2 months ago
- Wenet speech to text for react native☆10Updated 3 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year
- ☆17Updated 2 years ago
- Speech synthesis using LPC☆23Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆31Updated 4 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- ☆23Updated 10 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- Unsupervised speech activity detection system.☆11Updated 7 years ago
- ☆33Updated 4 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Updated 6 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Kanade is a speech tokenizer that encodes speech into compact content tokens and global embeddings and decodes them back to mel spectrogr…☆36Updated 2 weeks ago
- Speech Resynthesis and Language Modeling☆27Updated 7 months ago
- A SPMI Lab toolkit for language models.☆11Updated 8 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆22Updated last year
- C code to extract mfcc or fbank features from wav files☆17Updated 6 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated 2 years ago
- ☆11Updated 2 years ago
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆14Updated 10 months ago