archinetai / audio-data-pytorchLinks
A collection of useful audio datasets and transforms for PyTorch.
☆141Updated 2 years ago
Alternatives and similar repositories for audio-data-pytorch
Users that are interested in audio-data-pytorch are comparing it to the libraries listed below
Sorting:
- Audiogen Codec☆143Updated last year
- Self-supervised learning for real-time pitch estimation☆260Updated 3 weeks ago
- Pitch Estimating Neural Networks (PENN)☆268Updated 7 months ago
- A DDSP-based neural voice synthesiser.☆122Updated 11 months ago
- ☆177Updated 2 years ago
- ☆85Updated 2 years ago
- Encode and decode audio samples to/from compressed latent representations!☆239Updated last month
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- ☆235Updated last year
- A simple library for Fréchet Audio Distance (FAD) calculation☆235Updated 2 months ago
- Official implementation of SawSing (ISMIR'22)☆269Updated 3 years ago
- ☆169Updated 2 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆120Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆117Updated 2 years ago
- Results and Models for Learning Audio Representations of Music Content☆104Updated 11 months ago
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆121Updated 11 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆89Updated 5 months ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆88Updated 2 years ago
- Full models and training code for PESTO☆71Updated last year
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆160Updated 3 years ago
- Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University☆229Updated this week
- ☆86Updated 2 years ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Updated 2 years ago
- Perform transfer learning for MIR using Jukebox!☆183Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆209Updated 3 years ago
- PyTorch wrappers for using your model in audacity!☆179Updated 2 years ago