SonyResearch/dcase2025_stereo_seld_data_generator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SonyResearch/dcase2025_stereo_seld_data_generator)

SonyResearch / dcase2025_stereo_seld_data_generator

Data generator for stereo sound event localization and detection task of DCASE 2025 challenge

☆17

Alternatives and similar repositories for dcase2025_stereo_seld_data_generator

Users that are interested in dcase2025_stereo_seld_data_generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
aromanusc / SoundQ
View on GitHub
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
☆14Mar 21, 2025Updated last year
dberghi / AV-SELD
View on GitHub
Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
☆31Apr 26, 2024Updated 2 years ago
yxdong0320 / Solution_on_3D_SELD
View on GitHub
The program ranked first in Audio-only track of DCASE2024 Challenge task3.
☆22Mar 2, 2026Updated 4 months ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆47Sep 17, 2025Updated 10 months ago
Hong-Hengyi / MVANet-SELD
View on GitHub
For more detailed information, please refer to the paper titled "MVANet: Multi-Stage Video Attention Network for Sound Event Localization…
☆35May 20, 2025Updated last year
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
Jinbo-Hu / SELD-Data-Generator
View on GitHub
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonic…
☆22Nov 13, 2024Updated last year
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
danielkrause / Moving-Binaural-SDEL
View on GitHub
Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"
☆22Mar 2, 2025Updated last year
juliawilkins / ambisonics2binaural_simple
View on GitHub
A simple Python script to convert FOA audio to binaural.
☆17Nov 29, 2022Updated 3 years ago
AudibleLight / AudibleLight
View on GitHub
A controllable, end-to-end API for soundscape synthesis across ray-traced & real-world measured acoustics
☆27Apr 1, 2026Updated 3 months ago
adrianSRoman / DeepWaveDOA
View on GitHub
ICASSP 2024: Robust DOA estimation from deep acoustic imaging
☆25Apr 14, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sakshamsingh1 / sound_distance_estimation
View on GitHub
Official implementation of "sound distance estimation" WASPAA 23
☆20Dec 31, 2023Updated 2 years ago
PeiwenSun2000 / Both-Ears-Wide-Open
View on GitHub
The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
☆65Jul 2, 2025Updated last year
BingYang-20 / SRP-DNN
View on GitHub
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
☆67Sep 28, 2024Updated last year
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
omarAlezaby / Mimicked_Ali
View on GitHub
Repository for "Enhanced Super-Resolution Training via Mimicked Alignment for Real-World Scenes", ACCV 2024
☆16Dec 2, 2024Updated last year
zeroone-universe / TowardsRobustSpeechSR
View on GitHub
Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"
☆10May 8, 2023Updated 3 years ago
kleinfreund / balatrolator
View on GitHub
Balatro calculator
☆17Jul 10, 2026Updated 2 weeks ago
fschmid56 / cpjku_dcase23
View on GitHub
This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"
☆32Sep 18, 2023Updated 2 years ago
HUWEI0721 / 2023-Tongji-SSE-Experimemts-in-computer-organization
View on GitHub
2023-2024 同济大学软件学院第一学期计算机组成原理实验课程实验报告
☆11Mar 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vTAD2025-Challenge / vTAD
View on GitHub
☆17Oct 24, 2025Updated 9 months ago
fengpeng-yue / ASRTTS
View on GitHub
ASR & TTS joint training, asr, tts, machine speech chain
☆16Oct 16, 2021Updated 4 years ago
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
BASHLab / OWL
View on GitHub
☆15May 25, 2026Updated 2 months ago
Audio-WestlakeU / OnlineSSL_DPRTF_EG
View on GitHub
☆12Apr 1, 2020Updated 6 years ago
wwwwwyyyyyxxxxx / SA2GVAN
View on GitHub
☆13Sep 4, 2023Updated 2 years ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
egrinstein / xsrp
View on GitHub
Modular implementation of the Steered Response Power method and its variants
☆44Mar 25, 2026Updated 4 months ago
sxpro / RFSR
View on GitHub
codes for RFSR: Improving ISR Diffusion Models via Reward Feedback Learning
☆18Dec 8, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cjddny / cocos2d_guardCarrot
View on GitHub
cocos2d-x 保卫萝卜 C++
☆11Mar 1, 2016Updated 10 years ago
whichwhichgone / VLAS
View on GitHub
☆48Jul 8, 2025Updated last year
Robiwan245 / SiamMAE
View on GitHub
☆12Mar 5, 2024Updated 2 years ago
JinXins / SUMix
View on GitHub
About Official PyTorch(MMCV) implementation of “SUMix: Mixup with Semantic and Uncertain Information” (ECCV 2024)
☆12Sep 2, 2024Updated last year
victkk / 3DGS_SLAM_mobile_app
View on GitHub
☆18Oct 18, 2025Updated 9 months ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
3dlg-hcvc / multion-challenge
View on GitHub
Starter code and instructions for participating in MultiON Challenge 2021.
☆12Jun 12, 2024Updated 2 years ago