HLTCHKUST / elderly_ser
Transferability of cross-lingual and cross-age speech emotion recognition
☆18Updated last year
Alternatives and similar repositories for elderly_ser:
Users that are interested in elderly_ser are comparing it to the libraries listed below
- ☆53Updated 7 months ago
- flow mirror models from JZX AI Labs☆42Updated 4 months ago
- The case study and multilingfual performance of ICASSP submission☆20Updated 2 years ago
- Official release of StyleTalk dataset.☆61Updated 7 months ago
- Huawei Grad-TTS for Chinese☆46Updated last year
- Official Code for ParrotTTS☆49Updated 4 months ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- ☆65Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 9 months ago
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Updated 3 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Awesome TTS☆55Updated 3 years ago
- English conversation corpus for conversational TTS.☆20Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 5 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 3 years ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆11Updated 2 months ago
- ☆12Updated last year
- paraformer(chinense asr) online onnx runtime for python☆40Updated 10 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- ☆33Updated 5 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆40Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆23Updated 6 months ago
- F5-TTS 推理加速,速度提升约4倍!☆42Updated last month
- ☆25Updated 2 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆67Updated 2 years ago
- [ACM MM 2023] Official PyTorch implementation of "Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Reco…☆12Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆91Updated last month
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆47Updated 3 months ago