sky1456723 / Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆61Updated 3 years ago
Alternatives and similar repositories for Pytorch-MBNet:
Users that are interested in Pytorch-MBNet are comparing it to the libraries listed below
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆62Updated 3 years ago
- ☆29Updated 2 months ago
- ☆51Updated 8 months ago
- ☆32Updated 3 years ago
- ☆45Updated 2 months ago
- Alignment files of LibriTTS.☆61Updated 4 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- ☆88Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆48Updated 2 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆28Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 weeks ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- A simple package for Guided source separation (GSS)☆114Updated 9 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆52Updated 3 months ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆41Updated 3 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆55Updated 5 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆49Updated last month
- ☆47Updated 4 years ago
- An implementation of SkipVQVC with various settings.☆75Updated 4 years ago
- ☆64Updated last year
- A toolkit for any-to-any encoder-decoder voice conversion systems☆83Updated last year
- ☆43Updated 2 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆77Updated 3 years ago
- ☆30Updated last year
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Implementation of the AlignTTS☆76Updated last year