yinkalario / General-Purpose-Sound-Recognition-Demo
General purpose sound recognition demo
☆149Updated last year
Related projects ⓘ
Alternatives and complementary repositories for General-Purpose-Sound-Recognition-Demo
- ☆101Updated 4 years ago
- ☆198Updated 8 months ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆125Updated 4 years ago
- ☆76Updated last year
- Domestic environment sound event detection task☆129Updated 5 months ago
- ☆41Updated 2 months ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆366Updated 3 months ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆203Updated last year
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆98Updated last year
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.☆106Updated last year
- Repo associated to the DESED dataset, download and creation of data☆127Updated 4 months ago
- ☆79Updated last year
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆122Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆140Updated last year
- ☆53Updated 6 years ago
- Reading list for research topics in Sound AI☆166Updated 3 months ago
- ☆53Updated 4 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆100Updated last month
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆35Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆53Updated 4 years ago
- Baseline of DCASE 2020 task 4☆42Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆129Updated 5 months ago
- Audio transformations library for PyTorch☆226Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆233Updated 6 months ago
- Code for DCASE 2020 task 1a and task 1b.☆85Updated 2 years ago
- ☆223Updated 4 years ago
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆209Updated last year
- An open source dataset for source separation☆380Updated 9 months ago
- ☆62Updated 2 months ago