sakemin / demucs_batch-multigpuLinks
[Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation
☆23Updated 2 years ago
Alternatives and similar repositories for demucs_batch-multigpu
Users that are interested in demucs_batch-multigpu are comparing it to the libraries listed below
Sorting:
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Updated last year
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆52Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- million song dataset split for extended clean tag & artist-level stratified☆52Updated 2 years ago
- ☆45Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Updated 2 years ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆45Updated 7 months ago
- ☆60Updated 2 years ago
- ☆83Updated 2 years ago
- ☆74Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Updated 2 years ago
- PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model☆80Updated 2 years ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆54Updated last year
- Audiogen Codec☆144Updated last year
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆60Updated this week
- ☆65Updated 6 months ago
- Prosody and Pronunciation Modification Network☆60Updated 8 months ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆24Updated 4 years ago
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆55Updated 3 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆44Updated 2 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆23Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 3 years ago
- Reproducible Subjective Evaluation☆60Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆89Updated last month
- Landing Page for All Things Source Separation☆35Updated 4 months ago
- ☆85Updated 2 years ago
- Training, validation, and inference code for various SSL approaches and architectures.☆73Updated 2 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆92Updated 7 months ago