rom1504 / audio2datasetLinks
Easily turn large sets of audio urls to an audio dataset.
☆21Updated 2 years ago
Alternatives and similar repositories for audio2dataset
Users that are interested in audio2dataset are comparing it to the libraries listed below
Sorting:
- Voice swapping with VQ-VAE and diffusion models☆67Updated 4 years ago
- The demo page of UniAudio☆34Updated last year
- ☆22Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Updated 11 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated 2 years ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆116Updated last year
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 9 months ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆10Updated 3 years ago
- ☆106Updated 2 years ago
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆121Updated 11 months ago
- ☆32Updated 3 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Official source codes of airsep☆38Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated last week
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆25Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆27Updated 2 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Updated last year
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- Trainer for audio-diffusion-pytorch☆129Updated 2 years ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆43Updated last year
- ☆72Updated 2 months ago
- ☆35Updated 3 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆92Updated 2 years ago
- Fork of AudioLDM as a TuneFlow plugin☆41Updated 2 years ago