axeber01/wav2pos

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/axeber01/wav2pos)

axeber01 / wav2pos

3D Sound Source Localization using Masked Autoencoders

☆21

Alternatives and similar repositories for wav2pos

Users that are interested in wav2pos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

egrinstein / gnn_ssl
View on GitHub
Graph Neural Networks for Sound Source Localization
☆29Oct 31, 2023Updated 2 years ago
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆36Feb 11, 2025Updated last year
egrinstein / neural_srp
View on GitHub
The Neural-SRP method for DOA estimation
☆37May 24, 2024Updated 2 years ago
vlsi-nanocomputing / dynamic-sound
View on GitHub
DynamicSound Simulator is a modular Python library for generating virtual acoustic scenes with configurable microphones, sound sources, a…
☆18Jul 15, 2026Updated last week
axeber01 / ngcc
View on GitHub
Neural Generalized Cross Correlations https://arxiv.org/abs/2208.04654
☆37Feb 11, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
FYJNEVERFOLLOWS / Awesome-Sound-Source-Localization
View on GitHub
A tutorial for Sound Source Localization researchers and practitioners. The purpose of this repo is to organize the world’s resources for…
☆59Mar 17, 2023Updated 3 years ago
BingYang-20 / SRP-DNN
View on GitHub
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
☆67Sep 28, 2024Updated last year
FYJNEVERFOLLOWS / ResNet-STFT-SSL
View on GitHub
ResNet-STFT Model for Sound Source Localization
☆20Aug 25, 2022Updated 3 years ago
Devin-Pi / uncertainty-estimation-for-ssl
View on GitHub
This repo is for the paper "Uncertainty Estimation for Sound Source Localization".
☆15Mar 13, 2025Updated last year
Fhrozen / jrm_ssl
View on GitHub
Files for the paper: "Sound Source Localization using Deep Residual Learning"
☆24Nov 13, 2017Updated 8 years ago
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆47Sep 17, 2025Updated 10 months ago
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
idiap / nnsslm
View on GitHub
Neural Network based Sound Source Localization Models
☆51Aug 29, 2023Updated 2 years ago
ghunkins / Binaural-Source-Localization-CNN
View on GitHub
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…
☆10Dec 16, 2017Updated 8 years ago
jingkangqi / DSENet
View on GitHub
☆34Feb 19, 2025Updated last year
arief25ramadhan / sound-source-localization
View on GitHub
Four neural network architectures to classify sound source direction
☆11Oct 3, 2020Updated 5 years ago
heeeyk / Transformer-DOA-Prediction
View on GitHub
A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.
☆16Feb 17, 2025Updated last year
SOUNDS-RESEARCH / complex_neural_source_localization
View on GitHub
Complex-valued neural networks for DOA estimation
☆31Jan 25, 2023Updated 3 years ago
DavidDiazGuerra / icoDOA
View on GitHub
Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs
☆49May 19, 2022Updated 4 years ago
janhq / WhisperSpeech
View on GitHub
Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…
☆16Jan 20, 2025Updated last year
swimmiing / ACL-SSL
View on GitHub
Repository of the IJCV'26 & WACV'24 paper
☆34Apr 27, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zuhairmhtb / AudioClassification
View on GitHub
This software is a demonstration of Audio Signal Processing and Machine Learning using Python and Tensorflow. The software contains a GU…
☆12Dec 7, 2023Updated 2 years ago
ShiDongyuan / Selective_ANC_CNN
View on GitHub
☆13Jan 28, 2022Updated 4 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
CARNIVAL-IITP / Sound_source_localization
View on GitHub
☆36Feb 14, 2025Updated last year
bingo-todd / WaveLoc
View on GitHub
End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
zzb-nice / DOA_est_Master
View on GitHub
☆32Apr 21, 2025Updated last year
zszheng147 / Spatial-AST
View on GitHub
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
☆87Feb 13, 2025Updated last year
vineeths96 / TDOA-Localization
View on GitHub
In this repository, we deal with developing different estimators to localize Transvahan - the e-vehicle on IISc Campus using measurements…
☆20Jul 2, 2020Updated 6 years ago
Yifei-ZHAO96 / Tr-VAD
View on GitHub
Tr-VAD: An Efficient Transformer based Voice Activity Detection Model
☆18Aug 1, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ShlezingerLab / SubspaceNet
View on GitHub
☆82Dec 2, 2024Updated last year
IoTKETI / oneM2MBrowser
View on GitHub
KETI Mobius platform resource managament tool
☆11Jun 22, 2022Updated 4 years ago
violet-liang / soundfield-reconstruction-np
View on GitHub
Sound field reconstruction using neural processes with dynamic kernels
☆16Mar 25, 2025Updated last year
spatUV / SART3Dmaster
View on GitHub
Master repository for 3D Spatial Audio Reproduction Toolbox
☆22Jul 25, 2016Updated 10 years ago
upskyy / Paper-Review
View on GitHub
Paper Review about Speech Recognition · NLP
☆10Mar 25, 2021Updated 5 years ago
hasnainnaeem / Gunshot-Detection-in-Audio
View on GitHub
Audio classification deep learning model using TensorFlow 2.0 to detect Gunshots. 97.5% test set accuracy and 99% training set accuracy w…
☆23Feb 16, 2020Updated 6 years ago
wxqwinner / silero-vad-ncnn
View on GitHub
Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.
☆26Aug 21, 2024Updated last year