A collection of useful audio datasets and transforms for PyTorch.
☆144Feb 11, 2023Updated 3 years ago
Alternatives and similar repositories for audio-data-pytorch
Users that are interested in audio-data-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Trainer for audio-diffusion-pytorch☆129Jan 13, 2023Updated 3 years ago
- A collection of audio autoencoders, in PyTorch.☆44Mar 7, 2023Updated 3 years ago
- Audio generation using diffusion models, in PyTorch.☆2,096Jun 12, 2023Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆116Jan 27, 2023Updated 3 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆96Jun 12, 2025Updated 9 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Apr 27, 2023Updated 2 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆89Jun 12, 2023Updated 2 years ago
- Audio Dataset for training CLAP and other models☆731Jan 8, 2026Updated 2 months ago
- ☆87Jan 29, 2023Updated 3 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆72Dec 9, 2022Updated 3 years ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆123Jun 12, 2023Updated 2 years ago
- ☆12Mar 11, 2025Updated last year
- Keep track of big models in audio domain, including speech, singing, music etc.☆506Sep 26, 2024Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆47May 16, 2025Updated 10 months ago
- Reproducible Subjective Evaluation☆61Mar 3, 2024Updated 2 years ago
- A lightweight library for Frechet Audio Distance calculation.☆312Feb 11, 2026Updated last month
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 3 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- A timeline of the latest AI models for audio generation, starting in 2023!☆1,914Jan 4, 2024Updated 2 years ago
- ☆87May 21, 2023Updated 2 years ago
- alchemy with embeddings☆34Jun 14, 2023Updated 2 years ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆339Apr 1, 2025Updated 11 months ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- Collection of audio-focused loss functions in PyTorch☆856Jul 30, 2024Updated last year
- ☆22Jun 8, 2021Updated 4 years ago
- Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.☆789Sep 25, 2024Updated last year
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 8 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- Implementation of DiffWave and SaShiMi audio generation models☆128Apr 4, 2023Updated 2 years ago
- Deep Performer: Score-to-audio music performance synthesis☆44Jun 26, 2023Updated 2 years ago
- ☆20May 23, 2024Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 6 months ago
- music generation with masked transformers!☆351May 16, 2025Updated 10 months ago
- ☆59May 31, 2023Updated 2 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆104Mar 19, 2024Updated 2 years ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Mar 19, 2023Updated 3 years ago