haoheliu / ssr_eval
Evaluation and Benchmarking of Speech Super-resolution Methods
☆148Updated 2 years ago
Alternatives and similar repositories for ssr_eval:
Users that are interested in ssr_eval are comparing it to the libraries listed below
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆103Updated 3 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆118Updated 2 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆117Updated 2 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆187Updated 2 years ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆96Updated 2 months ago
- Implementation of the AlignTTS☆76Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- ☆122Updated 2 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆100Updated last year
- ☆170Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- ☆64Updated last year
- The official source code of UniAudio☆88Updated 11 months ago
- A pytroch implementation of the FB-MelGAN☆89Updated 4 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆144Updated 3 months ago
- ☆87Updated 2 years ago
- A differentiable version of SPTK☆180Updated last week
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆87Updated 4 years ago
- Alignment files of LibriTTS.☆61Updated 5 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆134Updated 2 years ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆155Updated 2 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 4 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆153Updated 8 months ago