ixxan / ug-speechLinks
☆12Updated 7 months ago
Alternatives and similar repositories for ug-speech
Users that are interested in ug-speech are comparing it to the libraries listed below
Sorting:
- Auto-KWS 2021 Challenge 1st place solution.☆11Updated 4 years ago
- ☆13Updated 4 years ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆37Updated 4 months ago
- End-to-End Speech Processing Toolkit☆9Updated 8 months ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆54Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆17Updated 3 years ago
- Went online decode demo☆31Updated 4 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Updated 4 years ago
- VoxLingua107 recipe for SpeechBrain☆13Updated 4 years ago
- ☆29Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- How to use our public wav2vec2 age and gender model☆48Updated last year
- ☆33Updated 3 years ago
- Huawei Grad-TTS for Chinese☆51Updated last year
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆20Updated 11 months ago
- faster inference☆28Updated 6 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated 2 years ago
- ☆44Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated 3 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆83Updated 11 months ago
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- Vox-Profile Benchmark☆42Updated this week
- TransferTTS (Zero-Shot learning of VITS)☆100Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆86Updated 2 years ago
- ☆43Updated 2 years ago
- ☆31Updated 3 years ago
- ☆38Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆43Updated 4 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago