rom1504 / audio2dataset
Easily turn large sets of audio urls to an audio dataset.
☆21Updated 2 years ago
Alternatives and similar repositories for audio2dataset:
Users that are interested in audio2dataset are comparing it to the libraries listed below
- ☆23Updated last year
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆31Updated last year
- text-to-audio-latent-diffusion☆37Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated this week
- Fork of AudioLDM as a TuneFlow plugin☆39Updated last year
- iSeparate library for the SDX2023 challenge☆13Updated last year
- Song Describer is a data collection platform for annotating music with textual descriptions.☆57Updated 3 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated last month
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Generate accompaniment part with chords using Evolutionary algorithm.☆9Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆24Updated last year
- GroupMap: beyond mean and variance matching for deep learning☆10Updated 2 years ago
- ☆15Updated 2 years ago
- SDX23 startkit for the Demucs baselines.☆27Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆44Updated this week
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 7 months ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆35Updated last year
- ☆40Updated 4 months ago
- ☆11Updated last year
- Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…☆26Updated last month
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- Voice swapping with VQ-VAE and diffusion models☆67Updated 3 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆21Updated 6 months ago
- Solos: A Dataset for Audio-Visual Music Analysis☆21Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆14Updated 2 years ago
- GPT for FACodec☆13Updated 11 months ago