ixxan / ug-speechLinks
☆12Updated 8 months ago
Alternatives and similar repositories for ug-speech
Users that are interested in ug-speech are comparing it to the libraries listed below
Sorting:
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆17Updated 3 years ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆39Updated 5 months ago
- ☆13Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated last week
- Keyword spotting for audio with attention (KWS model for audio)☆18Updated 4 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Updated 4 years ago
- Text frontend for ESPnet tts recipes☆33Updated 4 years ago
- kaldi cnn-tdnnf baseline☆13Updated 4 years ago
- Went online decode demo☆31Updated 4 years ago
- ☆61Updated 2 years ago
- ☆33Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆36Updated 7 years ago
- vad wraper on webrtcvad☆24Updated 8 years ago
- ☆29Updated 3 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 4 years ago
- ☆40Updated 4 years ago
- a kws demo on android☆39Updated last year
- ☆44Updated 4 years ago
- TransferTTS (Zero-Shot learning of VITS)☆100Updated 2 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated 2 years ago
- MagicData-RAMC Dataset and Baseline☆54Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆110Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆84Updated last year
- simple dnn based vad☆70Updated 6 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 2 years ago
- ☆33Updated 4 years ago
- ☆66Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆35Updated 3 months ago
- 56 language, 1 model Multilingual ASR☆25Updated 4 years ago