KosminD / YAMNet_transferLinks
☆20Updated 4 years ago
Alternatives and similar repositories for YAMNet_transfer
Users that are interested in YAMNet_transfer are comparing it to the libraries listed below
Sorting:
- Detect specific type of sound in audio signals☆12Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆142Updated 2 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆289Updated 7 months ago
- Phase-aware speech enchancement with Deep Complex U-Net☆114Updated 2 years ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆190Updated last year
- Real-time speech enhancement mobile app using Nested U-Net☆51Updated last year
- Repo associated to the DESED dataset, download and creation of data☆138Updated 11 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆78Updated 10 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆91Updated 4 years ago
- Convmelspec: Convertible Melspectrograms via 1D Convolutions☆143Updated last year
- Sound event detection with depthwise separable and dilated convolutions.☆53Updated 5 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆103Updated 10 months ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆100Updated 3 years ago
- PyTorch implementation of the LEAF audio frontend☆73Updated 2 years ago
- This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.☆101Updated 3 years ago
- Source code for Consistent ensemble distillation for audio tagging☆34Updated 2 weeks ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆55Updated 4 years ago
- TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain☆51Updated 3 years ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆70Updated 2 years ago
- Paderborn Sound Event Detection☆74Updated last year
- Speech Separation☆64Updated last year
- ☆93Updated 2 years ago
- Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement☆212Updated 2 years ago
- Easy to use Beamformers for multi-channel speech separation/enhancement☆200Updated 4 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆81Updated 4 years ago
- Real-time binaural target sound extraction model.☆86Updated last year
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆189Updated last year