generalwave / spleeter.pytorchLinks
Spleeter implementation in pytorch
☆26Updated 5 years ago
Alternatives and similar repositories for spleeter.pytorch
Users that are interested in spleeter.pytorch are comparing it to the libraries listed below
Sorting:
- simple dnn based vad☆70Updated 7 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated 2 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated 2 years ago
- ☆40Updated 4 years ago
- ☆33Updated 4 years ago
- Spleeter implementation in pytorch☆39Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Updated 4 years ago
- Huawei Grad-TTS for Chinese☆50Updated 2 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆94Updated 2 years ago
- WaveRNN Vocoder + TTS☆16Updated 5 years ago
- Efficient neural speech synthesis☆81Updated 5 years ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆54Updated 2 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 5 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆103Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- ☆61Updated 2 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Updated 5 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- style token with tacotron2☆62Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
- PyTorch reimplementation of Tacotron2 in Mandarin☆83Updated 4 years ago
- ☆68Updated 2 years ago
- using microphone☆17Updated 4 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆74Updated 6 years ago
- ☆45Updated 5 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆79Updated 3 years ago
- A demo of android key word spoting based on tensorflow tutial example☆28Updated 5 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆100Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Updated 4 years ago