yusunnny/CST-former

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yusunnny/CST-former)

yusunnny / CST-former

CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)

☆39

Alternatives and similar repositories for CST-former

Users that are interested in CST-former are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jinbo-Hu / SELD-Data-Generator
View on GitHub
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonic…
☆22Nov 13, 2024Updated last year
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆22Jul 17, 2024Updated 2 years ago
sadPororo / AD-YOLO
View on GitHub
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, IEEE ICASSP 2023
☆35Dec 21, 2025Updated 7 months ago
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Orlllem / seld_wav2vec2
View on GitHub
☆18Feb 1, 2026Updated 5 months ago
dberghi / AV-SELD
View on GitHub
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
☆31Apr 26, 2024Updated 2 years ago
Jinbo-Hu / DCASE2022-TASK3
View on GitHub
☆37Nov 14, 2024Updated last year
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆47Sep 17, 2025Updated 10 months ago
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆36Feb 11, 2025Updated last year
jiwonix / Sound-Event-Detection-papers
View on GitHub
Sound Event Detection (SED) paper collection
☆15Jun 26, 2024Updated 2 years ago
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
sakshamsingh1 / sound_distance_estimation
View on GitHub
Official implementation of "sound distance estimation" WASPAA 23
☆20Dec 31, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
danielkrause / Moving-Binaural-SDEL
View on GitHub
Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"
☆22Mar 2, 2025Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
thomeou / SALSA
View on GitHub
This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.
☆114May 31, 2022Updated 4 years ago
sarulab-speech / SpatialCLAP
View on GitHub
☆19Oct 9, 2025Updated 9 months ago
kinggongzilla / DCASE2023_Task2
View on GitHub
☆23May 15, 2023Updated 3 years ago
yxdong0320 / Solution_on_3D_SELD
View on GitHub
The program ranked first in Audio-only track of DCASE2024 Challenge task3.
☆22Mar 2, 2026Updated 4 months ago
axeber01 / wav2pos
View on GitHub
3D Sound Source Localization using Masked Autoencoders
☆21Feb 12, 2025Updated last year
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
antonioalmudevar / dcase2022_task2
View on GitHub
☆11Jul 6, 2022Updated 4 years ago
sharathadavanne / seld-dcase2023
View on GitHub
Baseline method for sound event localization task of DCASE 2023 challenge
☆71Mar 13, 2023Updated 3 years ago
soonhyeon / Noisy-ArcMix
View on GitHub
Noisy-ArcMix: Additive Noisy Angular Margin Loss Combined With Mixup for Anomalous Sound Detection
☆31Aug 22, 2024Updated last year
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
cai525 / Transformer4SED
View on GitHub
This repository aims to collect Transformer-based sound event detection (SED) algorithms.
☆104Feb 10, 2026Updated 5 months ago
yinkalario / EIN-SELD
View on GitHub
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
☆79Aug 5, 2021Updated 4 years ago
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
thomeou / General-network-architecture-for-sound-event-localization-and-detection
View on GitHub
This repository consists of python code to train sound event localization and detection models.
☆22Jan 21, 2021Updated 5 years ago
Devin-Pi / uncertainty-estimation-for-ssl
View on GitHub
This repo is for the paper "Uncertainty Estimation for Sound Source Localization".
☆15Mar 13, 2025Updated last year
egrinstein / gnn_ssl
View on GitHub
Graph Neural Networks for Sound Source Localization
☆30Oct 31, 2023Updated 2 years ago
sharathadavanne / seld-dcase2022
View on GitHub
Baseline method for sound event localization task of DCASE 2022 challenge
☆64Jun 21, 2022Updated 4 years ago
fschmid56 / PretrainedSED
View on GitHub
☆145May 13, 2025Updated last year
SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
Harper812 / FFDConv
View on GitHub
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
☆27May 13, 2026Updated 2 months ago