End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
Alternatives and similar repositories for WaveLoc
Users that are interested in WaveLoc are comparing it to the libraries listed below
Sorting:
- DNN based binaural sound localization model, using GCC-PHAT as features☆22Jun 13, 2023Updated 2 years ago
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated last year
- Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"☆16Mar 2, 2025Updated 11 months ago
- Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.☆11Sep 12, 2024Updated last year
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Dec 23, 2023Updated 2 years ago
- Binaural impulse responses captured in real rooms.☆37Mar 9, 2016Updated 9 years ago
- Complex-valued neural networks for DOA estimation☆29Jan 25, 2023Updated 3 years ago
- Code for "End-to-End Optimized Speech Coding with Deep Neural Networks" (ICASSP 2018)☆24May 18, 2018Updated 7 years ago
- The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…☆22Mar 24, 2017Updated 8 years ago
- Files for the paper: "Sound Source Localization using Deep Residual Learning"☆24Nov 13, 2017Updated 8 years ago
- Code for paper Learning Audio-Visual Dereverberation☆30Aug 10, 2022Updated 3 years ago
- [CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces☆26Mar 28, 2024Updated last year
- 2.5D visual sound☆118Jul 25, 2023Updated 2 years ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated last year
- The End-to-End Magnitude Least Squares Binaural Renderer for Spherical Microphone Array Signals☆39Feb 17, 2026Updated last week
- A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission☆32Apr 27, 2022Updated 3 years ago
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆87Mar 24, 2023Updated 2 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆176Jul 24, 2024Updated last year
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- ☆10Dec 8, 2025Updated 2 months ago
- This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset …☆12Sep 1, 2022Updated 3 years ago
- ☆10Jun 13, 2022Updated 3 years ago
- Mirror of the Auditory Modelling Toolbox http://amtoolbox.sourceforge.net/☆11Jan 28, 2019Updated 7 years ago
- CHiME-5 Baseline Array Synchronisation☆12Sep 24, 2018Updated 7 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- ☆12Jun 1, 2019Updated 6 years ago
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆14Jan 7, 2025Updated last year
- A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.☆15Feb 17, 2025Updated last year
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- [Re] Robust timing and motor patterns by taming chaos in recurrent neural networks, ReScience 2(1), 2016☆12Oct 7, 2016Updated 9 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- Colab notebooks exploring different Machine Learning topics.☆16Apr 2, 2022Updated 3 years ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year