ffxiong / uaspeech
Baseline kaldi script for UA-SPEECH corpus
☆30Updated 5 months ago
Alternatives and similar repositories for uaspeech:
Users that are interested in uaspeech are comparing it to the libraries listed below
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- ☆29Updated 3 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- ☆52Updated 9 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- ☆14Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated 2 months ago
- ☆36Updated 2 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated last year
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- ☆17Updated 6 years ago
- ☆15Updated 3 years ago
- Speech (audio) subjective evaluation system☆38Updated 4 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- ☆18Updated 6 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 5 months ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Updated 5 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆29Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago
- ☆16Updated 2 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆26Updated 6 months ago
- ☆33Updated 3 years ago
- acnn for text-independent speaker recognition☆9Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆30Updated 2 years ago