sakemin / demucs_batch-multigpuLinks
[Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation
☆23Updated 2 years ago
Alternatives and similar repositories for demucs_batch-multigpu
Users that are interested in demucs_batch-multigpu are comparing it to the libraries listed below
Sorting:
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆51Updated 2 years ago
- million song dataset split for extended clean tag & artist-level stratified☆52Updated 2 years ago
- ☆45Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆54Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆44Updated 2 years ago
- ☆74Updated last year
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆42Updated last year
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆45Updated 6 months ago
- Audiogen Codec☆144Updated last year
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆53Updated 2 months ago
- ☆60Updated 2 years ago
- A DDSP-based neural voice synthesiser.☆124Updated last year
- Prosody and Pronunciation Modification Network☆60Updated 7 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆92Updated 6 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- Official implementation for FlowSep☆68Updated 11 months ago
- ☆64Updated 5 months ago
- ☆111Updated 3 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated 2 years ago
- ☆86Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆79Updated last year
- Full models and training code for PESTO☆71Updated last year
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆118Updated 4 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆44Updated 7 months ago
- AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)☆102Updated 2 weeks ago
- Landing Page for All Things Source Separation☆35Updated 3 months ago