robertanto / Real-Time-Sound-Event-DetectionLinks
This repository contains the python implementation of a Sound Event Detection systems working in real time.
☆65Updated 2 years ago
Alternatives and similar repositories for Real-Time-Sound-Event-Detection
Users that are interested in Real-Time-Sound-Event-Detection are comparing it to the libraries listed below
Sorting:
- ☆94Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆137Updated 2 weeks ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- Reading list for research topics in Sound AI☆185Updated last year
- NSNet2 Deep Noise Suppression (DNS) package☆37Updated 2 years ago
- Visualization toolbox for Sound Event Detection☆122Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆126Updated 11 months ago
- Paderborn Sound Event Detection☆75Updated 2 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆101Updated 2 years ago
- Machine and Deep Learning models for speech dereverberation☆116Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆136Updated last year
- ☆58Updated last year
- ☆47Updated 11 months ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 9 months ago
- Python framework for Speech and Music Detection using Keras.☆108Updated 2 years ago
- ☆55Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆44Updated 3 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆202Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 4 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- ☆13Updated last month
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆92Updated last year
- Toolkit for downloading and processing Google's AudioSet dataset.☆170Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆182Updated last year
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆93Updated 6 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆102Updated 2 years ago
- Speech Dereverberation using Fully Convolutional Networks☆72Updated 4 years ago