laic / uoe_speech_processing_courseLinks
☆34Updated 2 months ago
Alternatives and similar repositories for uoe_speech_processing_course
Users that are interested in uoe_speech_processing_course are comparing it to the libraries listed below
Sorting:
- A list of papers for child ASR☆50Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆61Updated last year
- ☆57Updated last year
- CMU multilingual speech repository☆30Updated 3 years ago
- Alignment files of LibriTTS.☆65Updated 5 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆92Updated last year
- Reference Implementations of Waveform Evaluation Networks (WEnets)☆25Updated 2 years ago
- Yin pitch estimator in PyTorch☆117Updated 3 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- Implementation of audio degradation processes☆105Updated 10 years ago
- ☆107Updated 2 years ago
- Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)☆48Updated 3 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆191Updated 3 years ago
- A system works on singing voice synthesis☆79Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 7 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆121Updated last year
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- ☆40Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 5 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆128Updated 2 months ago
- Deep Articulatory Synthesis and Inversion☆54Updated last year
- MOS score prediction by fine-tuned wav2vec2.0 model☆173Updated 3 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆202Updated last year
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 6 years ago
- An open source platform for browser based speech and audio subjective quality tests.☆37Updated last month
- Keras-based python framework to compute phonological posterior probabilities from audio files☆45Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- Python implementation of the SRMR toolbox☆125Updated last year