rom1504 / audio2datasetLinks
Easily turn large sets of audio urls to an audio dataset.
☆21Updated 2 years ago
Alternatives and similar repositories for audio2dataset
Users that are interested in audio2dataset are comparing it to the libraries listed below
Sorting:
- Voice swapping with VQ-VAE and diffusion models☆67Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- ☆22Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- ☆32Updated 3 years ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆116Updated last year
- ☆106Updated last year
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Updated 10 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 3 years ago
- Fork of AudioLDM as a TuneFlow plugin☆41Updated 2 years ago
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆120Updated 10 months ago
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- ☆51Updated 11 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆11Updated 8 months ago
- App to explore latent spaces of music collections☆35Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Generate accompaniment part with chords using Evolutionary algorithm.☆10Updated 3 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆25Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated last year
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆91Updated 2 years ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- Demo for 2022 ICASSP☆64Updated 3 years ago
- ☆12Updated 5 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Updated last year