marl / autopoolLinks
Adaptive pooling operators for multiple instance learning
☆77Updated 3 years ago
Alternatives and similar repositories for autopool
Users that are interested in autopool are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms"☆59Updated 2 years ago
- Official implementation of the Seq-U-Net for efficient sequence modelling☆79Updated last year
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- Thomas Grill's "bulbul" bird audio detection system, adapted for DCASE 2018☆32Updated 6 years ago
- Implementation of deep recurrent nonnegative matrix factorization (DR-NMF) for speech separation☆49Updated 6 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆37Updated 7 years ago
- An audio classification system for learning with out-of-distribution data☆33Updated 2 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- Pytorch implementation of time-domain filterbanks☆112Updated 3 years ago
- Utils and data sets for audio and PyTorch☆86Updated 3 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆98Updated 6 years ago
- ☆47Updated 7 years ago
- Multiple Instance Learning for Sound Event Detection☆34Updated 7 years ago
- Learn and L3 embedding from audio/video pairs☆88Updated 3 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Updated 2 years ago
- audio processing module for pytorch:stft, istft☆36Updated 6 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Zero-shot Learning for Audio-based Music Classification and Tagging (ISMIR 2019)☆42Updated 5 years ago
- DenseNets for the detection of singing birds in audio files☆17Updated 7 years ago
- Baseline systems for the FSD50K dataset☆69Updated 3 years ago
- Pytorch and TensorFlow data loaders for several audio datasets☆113Updated 5 years ago
- Implementation of the BASIS algorithm for source separation with deep generative priors☆39Updated 2 years ago
- ☆19Updated 5 years ago
- ☆18Updated 4 years ago
- Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282☆95Updated 7 years ago
- Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"☆84Updated 6 years ago
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆54Updated 5 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Updated 7 years ago
- An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)☆12Updated 4 years ago
- Two implement for Constant Q Transform☆27Updated 8 years ago