ffxiong / uaspeechLinks
Baseline kaldi script for UA-SPEECH corpus
☆31Updated 11 months ago
Alternatives and similar repositories for uaspeech
Users that are interested in uaspeech are comparing it to the libraries listed below
Sorting:
- ☆37Updated 3 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated 11 months ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 8 months ago
- ☆15Updated 4 years ago
- ☆14Updated 3 years ago
- ☆11Updated last year
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- Speech (audio) subjective evaluation system☆41Updated 5 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆32Updated 9 months ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Updated 4 years ago
- ☆25Updated 3 weeks ago
- ☆52Updated 4 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 5 years ago
- ☆27Updated 3 years ago
- ☆19Updated last year
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Updated last year
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆16Updated 6 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- ☆32Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year