Easily turn large sets of audio urls to an audio dataset.
☆21Dec 27, 2022Updated 3 years ago
Alternatives and similar repositories for audio2dataset
Users that are interested in audio2dataset are comparing it to the libraries listed below
Sorting:
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆18Apr 7, 2025Updated 10 months ago
- Short-time Fourier transform (STFT) for JAX☆15Dec 20, 2021Updated 4 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Jul 13, 2021Updated 4 years ago
- Repo accompanying the blog post "How to Deploy PyTorch Models with Core ML Conversion Issues"☆14Jul 3, 2020Updated 5 years ago
- Converts stable diffusion embeddings to loadable pngs☆40Dec 6, 2022Updated 3 years ago
- Contains the thorough experiments made for a FloydHub article on Anomaly Detection☆16Oct 3, 2020Updated 5 years ago
- ☆15Apr 26, 2022Updated 3 years ago
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆22Jul 24, 2020Updated 5 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- Neural IIR Filter Field for HRTF Upsampling and Personalization☆27Feb 26, 2024Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 2 years ago
- A fast MP3 decoder for python, using minimp3☆30Sep 20, 2022Updated 3 years ago
- Thomas Grill's "bulbul" bird audio detection system, adapted for DCASE 2018☆33Sep 25, 2018Updated 7 years ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆32Jun 23, 2023Updated 2 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆111Apr 15, 2022Updated 3 years ago
- ☆30Nov 5, 2023Updated 2 years ago
- code associated with WANLI dataset in Liu et al., 2022☆31May 24, 2023Updated 2 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Mar 30, 2023Updated 2 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- gzip Predicts Data-dependent Scaling Laws☆34May 28, 2024Updated last year
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Oct 15, 2019Updated 6 years ago
- Example of backprop which uses constant memory☆43Aug 22, 2018Updated 7 years ago
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- ☆36Jan 6, 2026Updated last month
- Audio Dataset for training CLAP and other models☆730Jan 8, 2026Updated last month
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…☆34Nov 14, 2025Updated 3 months ago
- computer-aided rhythm analysis toolbox☆35Jul 9, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- GaugeMeterView is view which can be used in different Meter applications☆12Feb 25, 2022Updated 4 years ago
- ☆39Oct 3, 2022Updated 3 years ago
- CNN-based singing voice detection experiments☆37Apr 25, 2018Updated 7 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- List of direct speech-to-speech translation papers.☆38Jan 31, 2023Updated 3 years ago
- Gamma Agreement in Python☆45Mar 4, 2024Updated 2 years ago