KosminD / YAMNet_transfer
☆18Updated 4 years ago
Alternatives and similar repositories for YAMNet_transfer:
Users that are interested in YAMNet_transfer are comparing it to the libraries listed below
- Detect specific type of sound in audio signals☆12Updated 9 months ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆41Updated last year
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆97Updated 2 years ago
- Voice Activity Detection (VAD) using deep learning.☆195Updated 5 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆41Updated 3 years ago
- Reading list for research topics in Sound AI☆179Updated 7 months ago
- Repo associated to the DESED dataset, download and creation of data☆137Updated 8 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆272Updated 4 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆136Updated 3 months ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 5 years ago
- Dual-signal Transformation LSTM Network, PyTorch,NCNN☆73Updated 11 months ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆72Updated 3 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆54Updated 4 years ago
- Lightweight CNN for Robust Voice Activity Detection☆19Updated last year
- Easy to use Beamformers for multi-channel speech separation/enhancement☆195Updated 4 years ago
- ☆57Updated last year
- ☆216Updated last year
- ☆92Updated 2 years ago
- simple delaysum, MVDR and CGMM-MVDR☆257Updated 6 years ago
- Visualization toolbox for Sound Event Detection☆119Updated last year
- Real-time speech enhancement mobile app using Nested U-Net☆48Updated last year
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆143Updated last year
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆184Updated last year
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- General purpose sound recognition demo☆156Updated last year
- ☆9Updated 4 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆128Updated 3 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆63Updated 4 years ago
- ☆105Updated 4 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆133Updated last year