haoheliu / voicefixer_main
General Speech Restoration
☆276Updated last year
Alternatives and similar repositories for voicefixer_main:
Users that are interested in voicefixer_main are comparing it to the libraries listed below
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆211Updated 3 years ago
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆211Updated 6 months ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆209Updated last year
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆348Updated 2 months ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆324Updated 2 years ago
- PPG-Based Voice Conversion☆330Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆342Updated 2 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆249Updated 8 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆232Updated last year
- Conformer-based Metric GAN for speech enhancement☆336Updated 8 months ago
- ☆168Updated 2 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆201Updated 4 years ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆202Updated 4 months ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆145Updated last year
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆156Updated 3 years ago
- Easy-to-Use Speech MOS predictors☆251Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆154Updated 2 years ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆555Updated 2 weeks ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆130Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆318Updated 5 months ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆455Updated 2 months ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆313Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆144Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated 10 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆349Updated 5 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆235Updated 5 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆187Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆151Updated last year