AppleHolic / pytorch_sound
Sound Related Deep Learning Tasks boosting repository with pytorch
☆87Updated 9 months ago
Alternatives and similar repositories for pytorch_sound:
Users that are interested in pytorch_sound are comparing it to the libraries listed below
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆52Updated 5 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- ☆34Updated 5 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- A pytorch implementation of FFTNet.☆37Updated 6 years ago
- VQVAE for Unsupervised Voice Conversion☆21Updated 6 years ago
- A test bed for updates and new features | pytorch/audio☆169Updated 4 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 4 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- ☆42Updated 6 years ago
- 2018/2019 TTS framework integrating state of the art open source methods☆47Updated 5 years ago
- WaveGlow vocoder with VQVAE☆61Updated 5 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- The pytorch implementation of DC-TTS☆76Updated 6 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- A fast cnn-based vocoder☆78Updated 4 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- Benchmark popular audio i/o packages☆140Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆84Updated 6 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 7 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- Asteroid's filterbanks☆84Updated 3 months ago
- Implementation of the AlignTTS☆76Updated last year
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year