rom1504 / audio2datasetLinks
Easily turn large sets of audio urls to an audio dataset.
☆21Updated 3 years ago
Alternatives and similar repositories for audio2dataset
Users that are interested in audio2dataset are comparing it to the libraries listed below
Sorting:
- Voice swapping with VQ-VAE and diffusion models☆68Updated 4 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- ☆32Updated 3 years ago
- ☆23Updated 2 years ago
- Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…☆118Updated last year
- Fast and differentiable hidden Markov model in C++☆19Updated 3 years ago
- The demo page of UniAudio☆34Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆94Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆115Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 3 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 5 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Updated last year
- Demo for 2022 ICASSP☆64Updated 3 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Updated 3 months ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Updated 3 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated 2 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Updated last year
- ☆15Updated 3 years ago
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆27Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Updated 7 months ago