kuleshov / audio-super-res
Audio super resolution using neural networks
☆1,184Updated last year
Related projects ⓘ
Alternatives and complementary repositories for audio-super-res
- Implementation of the Wave-U-Net for audio source separation☆844Updated last year
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,562Updated 6 months ago
- A neural network for end-to-end speech denoising☆677Updated last year
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,000Updated 2 weeks ago
- Open-Unmix - Music Source Separation for PyTorch☆1,274Updated 4 months ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆638Updated 4 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆854Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆972Updated last year
- Deep Convolutional Neural Networks for Musical Source Separation☆471Updated 4 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆429Updated 3 years ago
- Deep learning for audio denoising☆653Updated last year
- General Speech Restoration☆1,035Updated 5 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,269Updated 3 months ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆645Updated 2 weeks ago
- Deep neural networks for separating singing voice from music written in TensorFlow☆797Updated 5 years ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,677Updated last year
- A flexible source separation library in Python☆620Updated last year
- A neural network for end-to-end music source separation☆225Updated 4 years ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆583Updated last year
- Audio style transfer with shallow random parameters CNN.☆404Updated last year
- WaveRNN Vocoder + TTS☆2,140Updated 2 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆815Updated last year
- Collection of audio-focused loss functions in PyTorch☆738Updated 3 months ago
- Neural network-based singing voice synthesis library for research☆689Updated last year
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,282Updated 5 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆903Updated last year
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆753Updated this week
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆1,952Updated 3 months ago
- CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)☆1,118Updated 2 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆278Updated last year