Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
☆20Dec 18, 2021Updated 4 years ago
Alternatives and similar repositories for binaural-sound-perception
Users that are interested in binaural-sound-perception are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆46Sep 10, 2025Updated 8 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆36Feb 15, 2024Updated 2 years ago
- Download scripts and tools for Replay dataset.☆38Jun 23, 2023Updated 2 years ago
- ☆38Jun 29, 2021Updated 4 years ago
- ☆12Dec 7, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Generator for anechoic, non-stationary noise signals☆11Aug 12, 2022Updated 3 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆32Feb 13, 2026Updated 3 months ago
- ☆17Jul 23, 2023Updated 2 years ago
- Code for Adaptation Network introduced in "Block-wise Scrambled Image Recognition Using Adaptation Network" paper (AAAI WS 2020)☆12Dec 3, 2019Updated 6 years ago
- Model for selecting perceptually relevant early reflections for parametric spatial sound rendering☆13Oct 26, 2023Updated 2 years ago
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆12Oct 3, 2024Updated last year
- The repository provides code for EgoMAN model and dataset creation scripts.☆31Dec 31, 2025Updated 5 months ago
- Extract your SlidesLive presentation.☆15Apr 19, 2024Updated 2 years ago
- Repo for our research paper "Learning Acoustic Scattering Fields for Dynamic Interactive Sound Propagation"☆17Apr 6, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- ☆27Feb 18, 2025Updated last year
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- Virtual Audio Loopback Cable for Windows☆10Sep 18, 2022Updated 3 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- Acoustic impulse response generation using diffusion models☆76Oct 3, 2023Updated 2 years ago
- 🎓 서울대 컴퓨터공학부 (컴공) 학위 논문 템플릿 | Thesis template for SNU CSE☆19Jan 5, 2026Updated 5 months ago
- Control of the Ball and Wheel System with a state-space controller.☆11Feb 27, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- Simple yet Powerfull Sapi 4/5 TTS Reader☆14Mar 24, 2019Updated 7 years ago
- Official PyTorch implementation of 'Blind Room Impulse Response Identification via Reverberant Speech Spectrum Reconstruction' [Interspee…☆33Updated this week
- ☆19Apr 1, 2020Updated 6 years ago
- Frequency-Space Neural Scene Representations for FMCW Radar☆62Sep 11, 2024Updated last year
- The first chapters of an online textbook to support the 3F8 Inference course.☆20Jan 2, 2019Updated 7 years ago
- Ruby sound generator (Using PortAudio)☆16Jul 23, 2013Updated 12 years ago
- DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image (ICLR 2025)☆24Jan 12, 2026Updated 4 months ago
- Primal-Dual Solver for Inverse Problems☆15Mar 24, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆17Jul 23, 2024Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 3 years ago
- A binaural audio unit.☆10Dec 8, 2014Updated 11 years ago
- ☆13May 16, 2021Updated 5 years ago
- NASH 2021 project... this may or may not end up working 🤷♂️☆12Dec 19, 2021Updated 4 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- ☆24Jun 12, 2024Updated last year