Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
☆20Dec 18, 2021Updated 4 years ago
Alternatives and similar repositories for binaural-sound-perception
Users that are interested in binaural-sound-perception are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for building speech-enabled websites.☆10Jul 10, 2015Updated 10 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆46Sep 10, 2025Updated 9 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆36Feb 15, 2024Updated 2 years ago
- Download scripts and tools for Replay dataset.☆39Jun 23, 2023Updated 3 years ago
- ☆38Jun 29, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Spectral Tensor Train Parameterization of Deep Learning Layers☆17Jul 1, 2021Updated 4 years ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆26Apr 26, 2026Updated 2 months ago
- ☆12Dec 7, 2024Updated last year
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆32Feb 13, 2026Updated 4 months ago
- ☆17Jul 23, 2023Updated 2 years ago
- Cython iterative farthest point sampling implementation☆12Mar 10, 2020Updated 6 years ago
- Code for Adaptation Network introduced in "Block-wise Scrambled Image Recognition Using Adaptation Network" paper (AAAI WS 2020)☆12Dec 3, 2019Updated 6 years ago
- Model for selecting perceptually relevant early reflections for parametric spatial sound rendering☆13Oct 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆14Oct 3, 2024Updated last year
- ☆28Nov 27, 2018Updated 7 years ago
- The repository provides code for EgoMAN model and dataset creation scripts.☆31Dec 31, 2025Updated 5 months ago
- Extract your SlidesLive presentation.☆15Apr 19, 2024Updated 2 years ago
- ☆16Oct 5, 2022Updated 3 years ago
- Repo for our research paper "Learning Acoustic Scattering Fields for Dynamic Interactive Sound Propagation"☆17Apr 6, 2021Updated 5 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- ☆31Dec 6, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- ☆27Feb 18, 2025Updated last year
- Personal website☆16Feb 20, 2026Updated 4 months ago
- negamax AI algorithm for turn-based games☆13Oct 6, 2019Updated 6 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- Audio-Visual Room Impulse Response Estimation☆24Jul 22, 2024Updated last year
- An implementation of SVM+☆26Sep 22, 2016Updated 9 years ago
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- Simple yet Powerfull Sapi 4/5 TTS Reader☆14Mar 24, 2019Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Official PyTorch implementation of 'Blind Room Impulse Response Identification via Reverberant Speech Spectrum Reconstruction' [Interspee…☆34Jun 4, 2026Updated 3 weeks ago
- ☆19Apr 1, 2020Updated 6 years ago
- Frequency-Space Neural Scene Representations for FMCW Radar☆62Sep 11, 2024Updated last year
- ☆11Mar 24, 2021Updated 5 years ago
- Dummy project to test your Open3D build☆10May 6, 2021Updated 5 years ago
- The first chapters of an online textbook to support the 3F8 Inference course.☆20Jan 2, 2019Updated 7 years ago
- Ruby sound generator (Using PortAudio)☆16Jul 23, 2013Updated 12 years ago