HLTCHKUST / elderly_serLinks
Transferability of cross-lingual and cross-age speech emotion recognition
☆18Updated last year
Alternatives and similar repositories for elderly_ser
Users that are interested in elderly_ser are comparing it to the libraries listed below
Sorting:
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- flow mirror models from JZX AI Labs☆44Updated 8 months ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- Official release of StyleTalk dataset.☆67Updated 11 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆37Updated 6 years ago
- ☆57Updated last year
- ☆65Updated last year
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- ☆38Updated 10 months ago
- Official Code for ParrotTTS☆51Updated 8 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆73Updated 2 years ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆25Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- English conversation corpus for conversational TTS.☆22Updated 2 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- ☆12Updated 2 years ago
- ☆25Updated 2 years ago
- ViSpeR: Multilingual Audio-Visual Speech Recognition☆40Updated 2 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆46Updated last year
- Curriculum Vitae of Quan Wang☆15Updated 2 weeks ago
- ☆66Updated 9 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆39Updated 11 months ago
- ☆13Updated last year
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 6 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated last year
- ☆56Updated 2 years ago
- 单独维护的中文TTS☆35Updated 2 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆13Updated 6 months ago