A collection of useful audio datasets and transforms for PyTorch.
☆144Feb 11, 2023Updated 3 years ago
Alternatives and similar repositories for audio-data-pytorch
Users that are interested in audio-data-pytorch are comparing it to the libraries listed below
Sorting:
- Trainer for audio-diffusion-pytorch☆129Jan 13, 2023Updated 3 years ago
- Audio generation using diffusion models, in PyTorch.☆2,095Jun 12, 2023Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Apr 27, 2023Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆95Jun 12, 2025Updated 8 months ago
- Audio Dataset for training CLAP and other models☆730Jan 8, 2026Updated last month
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- Reproducible Subjective Evaluation☆61Mar 3, 2024Updated 2 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆89Jun 12, 2023Updated 2 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆121Jun 12, 2023Updated 2 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆46May 16, 2025Updated 9 months ago
- ☆87Jan 29, 2023Updated 3 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆69Dec 9, 2022Updated 3 years ago
- Keep track of big models in audio domain, including speech, singing, music etc.☆506Sep 26, 2024Updated last year
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 2 years ago
- A collection of audio autoencoders, in PyTorch.☆44Mar 7, 2023Updated 2 years ago
- Collection of audio-focused loss functions in PyTorch☆854Jul 30, 2024Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆115Jan 27, 2023Updated 3 years ago
- ☆87May 21, 2023Updated 2 years ago
- A lightweight library for Frechet Audio Distance calculation.☆309Feb 11, 2026Updated 2 weeks ago
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,913Jan 4, 2024Updated 2 years ago
- ☆59May 31, 2023Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆339Apr 1, 2025Updated 11 months ago
- Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.☆786Sep 25, 2024Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆44Jun 26, 2023Updated 2 years ago
- Asteroid's filterbanks☆88Jan 12, 2025Updated last year
- Implementation of DiffWave and SaShiMi audio generation models☆128Apr 4, 2023Updated 2 years ago
- ☆22Jun 8, 2021Updated 4 years ago
- music generation with masked transformers!☆350May 16, 2025Updated 9 months ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Mar 19, 2023Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- ☆20May 23, 2024Updated last year
- A real time implementation of the ddsp from google magenta.☆15Nov 8, 2021Updated 4 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆111Aug 29, 2024Updated last year