tobiastoft91 / VGGish_AudioClassifer_02456
Acoustic Scene Classification using transfer learning on VGGish pre-trained model
☆11Updated 7 years ago
Alternatives and similar repositories for VGGish_AudioClassifer_02456:
Users that are interested in VGGish_AudioClassifer_02456 are comparing it to the libraries listed below
- Genre Classification Model Based on VGGish☆11Updated last year
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Updated 3 years ago
- ☆58Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆15Updated 6 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Updated 9 months ago
- Paderborn Sound Event Detection☆74Updated last year
- CNN-based singing voice detection experiments☆37Updated 7 years ago
- CP-JKU submission to DCASE 19, performant single-model CNN☆56Updated 4 years ago
- Code accompanying the paper: An Attention Mechanism for Musical Instrument Recognition. ISMIR 2019☆24Updated 5 years ago
- MediaEval 2020: Music Mood Classification☆18Updated 4 years ago
- Introducing multi-channel U-Net for Music Source Separation trained using weighted multi-task loss.☆32Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 4 months ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- Python 3.5 and Windows version of Speech Enhancement using DNN by Yong Xu and Qiuqiang Kong☆15Updated 6 years ago
- Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook☆67Updated 2 years ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Updated 2 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆51Updated 4 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆42Updated 3 years ago
- ☆31Updated 9 months ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆41Updated 5 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆42Updated 2 years ago