KosminD / YAMNet_transfer
☆17Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for YAMNet_transfer
- Detect specific type of sound in audio signals☆11Updated 4 months ago
- Combine sound source separation with SRP-PHAT to achieve multi-source localization.☆51Updated 8 months ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆66Updated 3 years ago
- Dual-signal Transformation LSTM Network, PyTorch,NCNN☆67Updated 7 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 3 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆41Updated last year
- Reading list for research topics in Sound AI☆165Updated 3 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆230Updated 6 months ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆89Updated 2 years ago
- ☆9Updated 4 years ago
- Phase-aware speech enchancement with Deep Complex U-Net☆96Updated last year
- Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement☆194Updated 2 years ago
- Repo associated to the DESED dataset, download and creation of data☆125Updated 3 months ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆312Updated 2 weeks ago
- ☆46Updated last year
- Collection of EM algorithms for blind source separation of audio signals☆269Updated 3 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆76Updated 3 months ago
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 4 years ago
- simple delaysum, MVDR and CGMM-MVDR☆238Updated 5 years ago
- Easy to use Beamformers for multi-channel speech separation/enhancement☆186Updated 3 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆105Updated last year
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆125Updated 2 weeks ago
- ☆53Updated 6 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆105Updated 2 months ago
- ☆86Updated 2 years ago
- TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain☆46Updated 2 years ago
- ☆18Updated 3 years ago
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆185Updated 2 years ago