GPUPhobia / vocal-maskLinks
β12Updated 6 years ago
Alternatives and similar repositories for vocal-mask
Users that are interested in vocal-mask are comparing it to the libraries listed below
Sorting:
- β75Updated 4 months ago
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β31Updated last year
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPβ¦β33Updated last year
- Code base for WaveTransformer: A novel architecture for automated audio captioningβ44Updated 4 years ago
- Python library for audio augmentationβ85Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Updated 3 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.ioβ51Updated 6 months ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- Score- and Lyrics-Free Singing Voice Generationβ28Updated 5 years ago
- Semi-supervised learning using teacher-student models for vocal melody extractionβ43Updated 4 years ago
- β24Updated 7 years ago
- β45Updated 3 years ago
- β32Updated 4 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samplesβ24Updated 5 years ago
- Deep Speech Distances PyTorchβ29Updated 3 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate Pβ¦β11Updated 3 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)β43Updated 6 years ago
- MelNet-Tensorflow implementationβ40Updated 5 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"β33Updated 3 years ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Updated 3 years ago
- follow NVIDIA, simplify it and support data parallel.β13Updated 6 years ago
- β10Updated last year
- Keras implementation of musicnn, a set of pre-trained deep convolutional neural networks for music audio taggingβ27Updated 4 years ago
- β12Updated 8 years ago
- SiSEC MUS 2018 Submission Systemβ43Updated 6 years ago
- Official PyTorch implementation of TTS Style Transferβ25Updated 3 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Modelβ26Updated 2 years ago
- β18Updated 4 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsamplingβ37Updated 4 years ago
- Toolbox for easy and qualitative one-shot voice conversionβ46Updated 4 years ago