archinetai / audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.
☆139Updated 2 years ago
Alternatives and similar repositories for audio-data-pytorch:
Users that are interested in audio-data-pytorch are comparing it to the libraries listed below
- Pitch Estimating Neural Networks (PENN)☆249Updated 2 weeks ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 8 months ago
- Encode and decode audio samples to/from compressed latent representations!☆192Updated last month
- Self-supervised learning for fast pitch estimation☆216Updated last month
- Audiogen Codec☆134Updated 9 months ago
- A simple library for Fréchet Audio Distance (FAD) calculation☆199Updated this week
- ☆164Updated last year
- A DDSP-based neural voice synthesiser.☆115Updated 5 months ago
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆117Updated last year
- PyTorch wrappers for using your model in audacity!☆174Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Updated last year
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆112Updated 4 months ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆269Updated 2 weeks ago
- ☆84Updated last year
- (ML) audio engineering i/o utils☆54Updated 2 weeks ago
- Official implementation of SawSing (ISMIR'22)☆260Updated 2 years ago
- ☆81Updated 2 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆85Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆147Updated 2 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆209Updated 3 weeks ago
- Pytorch implementation of BigVSAN☆204Updated last year
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆51Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆115Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset☆80Updated last year
- ☆197Updated last year
- ☆43Updated 10 months ago