flageval-baai / SeniorTalk
A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors
☆12Updated last month
Alternatives and similar repositories for SeniorTalk
Users that are interested in SeniorTalk are comparing it to the libraries listed below
Sorting:
- Project of Singing Voice Conversion.☆14Updated last year
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆29Updated last month
- ☆20Updated 7 months ago
- noise reduction☆17Updated 10 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 8 months ago
- ☆10Updated 6 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆19Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆19Updated 3 months ago
- ☆13Updated 8 months ago
- CTC decoder with hotwords for ASR.☆20Updated last month
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 8 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 2 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆12Updated 2 years ago
- ☆11Updated 2 years ago
- Cantonese Text to Speech with VITS implementation☆29Updated 2 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆17Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 6 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- ☆15Updated 2 months ago
- ☆34Updated 2 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- 将任意人的音色转换为成千上万种不同音色☆28Updated last year
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated last year
- Supervoice Speaker Separation Network☆12Updated 11 months ago
- Using OpenVINO to speed up MeloTTS inference☆11Updated 6 months ago
- silero-vad pytorch implement☆17Updated 5 months ago