multitel-ai / urban-sound-taggingLinks

1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context

☆16

Alternatives and similar repositories for urban-sound-tagging

Users that are interested in urban-sound-tagging are comparing it to the libraries listed below

Sorting:

sainathadapa / dcase2019-task5-urban-sound-tagging
1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging
☆30Updated 4 years ago
marc-moreaux / audioset_raw
Download and create a tfreader for the audioset dataset
☆16Updated 5 years ago
qiuqiangkong / sed_time_freq_segmentation
☆45Updated 6 years ago
soham97 / MTL_Weakly_labelled_audio_data
Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"
☆17Updated 2 years ago
dr-costas / dnd-sed
Sound event detection with depthwise separable and dilated convolutions.
☆53Updated 5 years ago
popcornell / OSDC
☆16Updated 4 years ago
nttcslab / composing-general-audio-repr
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
☆26Updated 2 years ago
ssrp / SubSpectralNet-PyTorch
PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…
☆22Updated 6 years ago
edufonseca / FSD50K_baseline
Baseline systems for the FSD50K dataset
☆69Updated 3 years ago
bill317996 / Singer-identification-in-artist20
Addressing the confounds of accompaniments in singer identification
☆18Updated 5 years ago
vivsivaraman / sourcesepganprior
☆18Updated 4 years ago
qiuqiangkong / sound_event_detection_dcase2017_task4
☆54Updated 5 years ago
audio-captioning / dcase-2020-baseline
Audio captioning baseline system for DCASE 2020 challenge.
☆38Updated last year
RicherMans / CDur
Repository for the paper "Towards duration robust weakly supervised sound event detection"
☆23Updated 2 years ago
hearbenchmark / hear-baseline
Simple baseline model for the HEAR benchmark
☆23Updated last month
edufonseca / uclser20
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆92Updated 2 years ago
raymondxyy / strfnet-IS2020
Official repo for the STRFNet system appeared in INTERSPEECH2020
☆12Updated 4 years ago
haoheliu / DCASE_2022_Task_5
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Updated 3 years ago
yinkalario / Sound-Event-Detection-AudioSet
☆47Updated 11 months ago
dr-costas / SEDLM
Language modelling for sound event detection
☆20Updated 5 years ago
cvqluu / MTL-Speaker-Embeddings
Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…
☆25Updated 2 years ago
corticph / MSTmodel
Code for https://arxiv.org/abs/1712.00254
☆16Updated 7 years ago
BiometricVox / DAE_SpeakerID
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Updated 6 years ago
ws-choi / LASAFT-Net-v2
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Updated 3 years ago
RanyaJumah / Emotionless_Privacy_Preserving_Speech_Analysis
☆10Updated 6 years ago
tqbl / ood_audio
An audio classification system for learning with out-of-distribution data
☆33Updated 2 years ago
WangHelin1997 / GL-AT
Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.
☆13Updated 4 years ago
qiuqiangkong / ICASSP2018_audioset
☆27Updated 7 years ago
CHeggan / MetaAudio-A-Few-Shot-Audio-Classification-Benchmark
A new comprehensive and diverse few-shot acoustic classification benchmark.
☆64Updated 10 months ago
shangeth / wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆92Updated 4 years ago