generalwave / spleeter.pytorch
Spleeter implementation in pytorch
☆26Updated 4 years ago
Alternatives and similar repositories for spleeter.pytorch
Users that are interested in spleeter.pytorch are comparing it to the libraries listed below
Sorting:
- Spleeter implementation in pytorch☆39Updated 2 years ago
- simple dnn based vad☆70Updated 6 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆91Updated last year
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆69Updated 4 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- PyTorch implementation of Tacotron and Tacotron2☆32Updated 2 years ago
- ☆65Updated last year
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- TTS Text Analyzer☆32Updated last year
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆21Updated 5 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆83Updated last year
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Updated 3 years ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆145Updated 3 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis☆72Updated 3 years ago
- Predict prosody labels for Chinese sentences.☆41Updated 2 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆20Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆76Updated 4 months ago
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆87Updated 4 years ago
- ☆29Updated 4 years ago
- style token with tacotron2☆61Updated last year
- ☆43Updated 4 years ago
- ☆75Updated 3 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆99Updated 3 years ago
- WaveRNN Vocoder + TTS☆16Updated 4 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆85Updated 2 years ago