zhaoyanpeng / audioset-dlLinks
Download AudioSet for Vision-Audio-Text Pre-training
☆13Updated 3 years ago
Alternatives and similar repositories for audioset-dl
Users that are interested in audioset-dl are comparing it to the libraries listed below
Sorting:
- VIsually-Pivoted Audio and(N) Text☆22Updated 3 years ago
- Dataset and baseline for the first Audiocaption task☆79Updated last year
- CNN-based singing voice detection experiments☆37Updated 7 years ago
- ☆48Updated 2 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- ☆32Updated 4 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- ☆16Updated 5 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorch☆87Updated last year
- Learn and L3 embedding from audio/video pairs☆88Updated 3 years ago
- Baseline systems for the FSD50K dataset☆69Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 5 years ago
- Util code, issues, discussions☆29Updated 7 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆42Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Simple baseline model for the HEAR benchmark☆23Updated this week
- lazy_dataset: Process large datasets as if it was an iterable.☆18Updated 8 months ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 5 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Crowdsourced Audio Quality Evaluation Toolkit☆55Updated 2 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Utils and data sets for audio and PyTorch☆86Updated 3 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Updated 2 years ago
- Self-Supervised Contrastive Learning of Music Spectrograms☆31Updated 4 years ago
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆52Updated 5 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago