rom1504 / audio2dataset
Easily turn large sets of audio urls to an audio dataset.
☆20Updated 2 years ago
Alternatives and similar repositories for audio2dataset:
Users that are interested in audio2dataset are comparing it to the libraries listed below
- ☆23Updated last year
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- GroupMap: beyond mean and variance matching for deep learning☆10Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆9Updated 2 years ago
- ☆15Updated 2 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆57Updated 3 months ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- SDX23 startkit for the Demucs baselines.☆27Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last month
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 7 months ago
- text-to-audio-latent-diffusion☆37Updated last year
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆20Updated 3 years ago
- Solos: A Dataset for Audio-Visual Music Analysis☆21Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆44Updated 6 months ago
- ☆32Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 10 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆34Updated 6 months ago
- ☆32Updated 4 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆24Updated last year
- Steer OpenAI's Jukebox with Music Taggers☆42Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year
- Based on https://github.com/fatchord/WaveRNN☆24Updated 4 years ago
- ☆31Updated 2 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago