sky1456723 / Pytorch-MBNetLinks

A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK

☆60

Alternatives and similar repositories for Pytorch-MBNet

Users that are interested in Pytorch-MBNet are comparing it to the libraries listed below

Sorting:

dhimasryan / MOSA-Net-Cross-Domain
☆56Updated last year
shaojinding / Adversarial-Many-to-Many-VC
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Updated 2 years ago
soumimaiti / speechlmscore_tool
☆32Updated 8 months ago
nii-yamagishilab / mos-finetune-ssl
☆98Updated 2 years ago
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆30Updated 2 years ago
JasonSWFu / VQscore
☆54Updated 8 months ago
kan-bayashi / LibriTTSLabel
Alignment files of LibriTTS.
☆64Updated 5 years ago
facebookresearch / Noresqa
This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…
☆102Updated 2 years ago
ttslr / python-MCD
☆49Updated 5 years ago
NaoyukiKanda / LibriSpeechMix
☆36Updated 4 years ago
haoheliu / torchsubband
Pytorch implementation of subband decomposition
☆92Updated 3 years ago
khhungg / BSSE-SE
Boosting Self-Supervised Embeddings for Speech Enhancement
☆47Updated 3 years ago
ericwudayi / SkipVQVC
An implementation of SkipVQVC with various settings.
☆75Updated 5 years ago
AI-Unicamp / TTS-Objective-Metrics
Objective metrics used in several text-to-speech (TTS) papers.
☆49Updated last month
b04901014 / UUVC
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆81Updated 2 years ago
xcmyz / ConvTasNet4BasisMelGAN
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Updated 4 years ago
mutiann / speech_rankings
A CSRankings-like index for speech researchers
☆34Updated 9 months ago
YangAi520 / APNet
☆34Updated 2 years ago
zceng / LVCNet
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Updated 4 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago
thuhcsi / icassp2021-emotion-tts
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Updated 2 years ago
rishikksh20 / multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆43Updated 4 years ago
MingjieChen / EasyVC
A toolkit for any-to-any encoder-decoder voice conversion systems
☆84Updated last year
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Updated 3 years ago
unilight / sheet
Speech Human Evaluation Estimation Toolkit (SHEET)
☆93Updated last month
mycrazycracy / speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Updated 6 years ago
Deepest-Project / AlignTTS
Implementation of the AlignTTS
☆77Updated 2 years ago
jinhan / tacotron2-gst
Tacotron2 with Global Style Tokens
☆64Updated 6 years ago
rishikksh20 / Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
☆106Updated 3 years ago
unilight / LDNet
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
☆65Updated 3 years ago