ffxiong / uaspeechLinks
Baseline kaldi script for UA-SPEECH corpus
☆32Updated last year
Alternatives and similar repositories for uaspeech
Users that are interested in uaspeech are comparing it to the libraries listed below
Sorting:
- ☆37Updated 3 years ago
- ☆17Updated 2 years ago
- ☆19Updated last year
- ☆32Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 6 years ago
- ☆28Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- MSP-Podcast Challenge Baseline Code☆29Updated last year
- ☆31Updated last month
- ☆16Updated 6 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- ☆15Updated 4 years ago
- ☆26Updated last year
- ☆11Updated 2 years ago
- Speech (audio) subjective evaluation system☆42Updated 5 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 5 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆15Updated 4 years ago
- An evaluation toolkit for voice conversion models.☆42Updated 4 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Updated 4 years ago
- ☆16Updated 4 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago