JuanPZuluaga / accent-recog-slt2022Links
Repository for Accent Recognition (Hackathon @SLT2022)
☆33Updated last year
Alternatives and similar repositories for accent-recog-slt2022
Users that are interested in accent-recog-slt2022 are comparing it to the libraries listed below
Sorting:
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 9 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago
- ☆80Updated 3 weeks ago
- ☆56Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 6 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- A list of papers for child ASR☆46Updated 10 months ago
- ☆25Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆30Updated last year
- ☆64Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆29Updated last year
- ☆32Updated 9 months ago
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆56Updated 6 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆34Updated 11 months ago
- multilingual speech aligner☆76Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated last year
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 5 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆63Updated last month
- ☆13Updated 9 months ago
- ☆54Updated last year
- This is the M-AILABS Speech Dataset☆78Updated 9 months ago