haoheliu / voicefixerLinks
General Speech Restoration
☆1,254Updated 10 months ago
Alternatives and similar repositories for voicefixer
Users that are interested in voicefixer are comparing it to the libraries listed below
Sorting:
- Versatile audio super resolution (any -> 48kHz) with AudioSR.☆1,683Updated 3 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆1,028Updated last year
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆697Updated 11 months ago
- Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs☆696Updated 3 months ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆1,158Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆1,042Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,662Updated 2 weeks ago
- Official Implementation of StyleTTS☆456Updated 11 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,853Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,279Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆897Updated last year
- AI powered speech denoising and enhancement☆2,112Updated last year
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆668Updated last year
- Model for MDX23 music separation contest☆811Updated 8 months ago
- Voice Conversion With Just Nearest Neighbors☆506Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆411Updated last year
- Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch☆1,333Updated 2 years ago
- Official implementation of "Separate Anything You Describe"☆1,853Updated last year
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design☆626Updated 2 years ago
- General Speech Restoration☆283Updated last year
- ☆292Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,026Updated 2 years ago
- unofficial vits2-TTS implementation in pytorch☆543Updated last year
- 🐸 collection of TTS papers☆719Updated last year
- A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…☆457Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆301Updated 2 years ago
- The code for the bark-voicecloning model. Training and inference.☆710Updated 2 years ago
- Pytorch implementation of the CREPE pitch tracker☆495Updated 7 months ago
- Repository for training models for music source separation.☆1,072Updated 2 weeks ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆694Updated 4 months ago