WangHelin1997/GL-AT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WangHelin1997/GL-AT)

WangHelin1997 / GL-AT

Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.

☆13

Alternatives and similar repositories for GL-AT

Users that are interested in GL-AT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
yangdongchao / Target-sound-event-detection
View on GitHub
The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
RicherMans / CDur
View on GitHub
Repository for the paper "Towards duration robust weakly supervised sound event detection"
☆23Aug 3, 2023Updated 2 years ago
lukewys / dcase_2020_T6
View on GitHub
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…
☆24Aug 3, 2023Updated 2 years ago
qiuqiangkong / sound_event_detection_dcase2017_task4
View on GitHub
☆55Jun 3, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
emilywg / DCASE2020-Task1
View on GitHub
Jupyter notebook for DCASE 2020 challenge Task 1
☆20Jun 24, 2020Updated 6 years ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
WangHelin1997 / AT-GCN
View on GitHub
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆14Sep 18, 2020Updated 5 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
McDonnell-Research-Lab / DCASE2019-Task1
View on GitHub
Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…
☆22Jul 3, 2024Updated 2 years ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
yangdongchao / DCASE2021Task5
View on GitHub
The code for DCASE2021 task5 submission.
☆20Feb 21, 2022Updated 4 years ago
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qiuqiangkong / dcase2019_task4
View on GitHub
☆21Apr 11, 2019Updated 7 years ago
shayangharib / AUDASC
View on GitHub
Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification
☆36Aug 23, 2018Updated 7 years ago
qiuqiangkong / audioset_source_separation
View on GitHub
☆17Feb 14, 2020Updated 6 years ago
marc-moreaux / audioset_raw
View on GitHub
Download and create a tfreader for the audioset dataset
☆17Apr 16, 2020Updated 6 years ago
JishengBai / ICME2024ASC
View on GitHub
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆18Mar 16, 2024Updated 2 years ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
toni-heittola / dcase2020_task1_baseline
View on GitHub
DCASE2020 Challenge Task 1 baseline system
☆25Jun 22, 2020Updated 6 years ago
denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
WangHelin1997 / nnAudio2
View on GitHub
Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…
☆21Nov 30, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
edufonseca / uclser20
View on GitHub
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆93Dec 22, 2022Updated 3 years ago
audio-captioning / audio-captioning-papers
View on GitHub
A list of papers about audio captioning
☆78Jul 1, 2022Updated 4 years ago
y-kawagu / dcase2020_task2_baseline
View on GitHub
DCASE2020 Challenge Task 2 baseline system
☆120Dec 27, 2022Updated 3 years ago
ssrp / SubSpectralNet
View on GitHub
SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019
☆18Feb 20, 2019Updated 7 years ago
zhang201882 / MTF-CRNN
View on GitHub
Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…
☆23Apr 15, 2020Updated 6 years ago
OzerCanDevecioglu / Exploring-Sound-vs-Vibration-for-Robust-Fault-Detection-on-Rotating-Machinery
View on GitHub
☆13Jul 4, 2024Updated 2 years ago
C-Fun / Self-Attentive-Pooling-for-Efficient-Deep-Learning
View on GitHub
Official PyTorch implementation of the paper entitled 'Self Attentive Pooling for Efficient Deep Learning'.
☆13May 3, 2024Updated 2 years ago
DTaoo / Discriminative-Sounding-Objects-Localization
View on GitHub
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
☆61Jan 19, 2022Updated 4 years ago
RicherMans / AudioCaption
View on GitHub
Dataset and baseline for the first Audiocaption task
☆79Jul 25, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HumBug-Mosquito / HumBugDB
View on GitHub
Acoustic mosquito detection code with Bayesian Neural Networks
☆62Oct 4, 2021Updated 4 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
aispeech-lab / WASE
View on GitHub
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…
☆27Jan 11, 2022Updated 4 years ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
chenchy / D3Net
View on GitHub
A pytorch implementation of D3Net.
☆11Aug 8, 2021Updated 4 years ago
marmoi / dcase2023_task4b_baseline
View on GitHub
Baseline code for DCASE 2023 task 4 B
☆15Apr 21, 2023Updated 3 years ago