dieKarotte/ASAudio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dieKarotte/ASAudio)

dieKarotte / ASAudio

☆59

Alternatives and similar repositories for ASAudio

Users that are interested in ASAudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
dieKarotte / Spatial-Omni
View on GitHub
☆27Jun 17, 2026Updated last month
zszheng147 / Spatial-AST
View on GitHub
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
☆87Feb 13, 2025Updated last year
jaeyeonkim99 / visage
View on GitHub
Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)
☆47Sep 10, 2025Updated 10 months ago
Ruiqi-Yan / Awesome-Audio-Editing
View on GitHub
A curated list of models, benchmarks, tools and guides for audio editing
☆33Jul 7, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wilkinghoff / DSpAST
View on GitHub
Code for the paper "DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models"
☆17Oct 23, 2025Updated 8 months ago
BASHLab / OWL
View on GitHub
☆15May 25, 2026Updated last month
Jinbo-Hu / SELD-Data-Generator
View on GitHub
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonic…
☆22Nov 13, 2024Updated last year
QxLabIreland / Binamix
View on GitHub
A Python Library for Binaural Mixing and Data Generation
☆56Jan 23, 2026Updated 5 months ago
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆22Feb 18, 2026Updated 5 months ago
PeiwenSun2000 / Both-Ears-Wide-Open
View on GitHub
The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
☆65Jul 2, 2025Updated last year
danielkrause / Moving-Binaural-SDEL
View on GitHub
Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"
☆22Mar 2, 2025Updated last year
liuhuadai / OmniAudio
View on GitHub
[ICML 2025] PyTorch Implementation of "OmniAudio: Generating Spatial Audio from 360-Degree Video"
☆374Jun 27, 2025Updated last year
xiquan-li / Awesome-Audio-Generation
View on GitHub
Curated list for papers, codes and resources related to Text-to-Audio (TTA) Generation
☆74May 27, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
HuMathe / sonoworld
View on GitHub
Official implementation of the CVPR 2026 paper "SonoWorld: From One Image to a 3D Audio-Visual Scene."
☆39Jul 6, 2026Updated 2 weeks ago
IoSR-Surrey / IoSR_ListeningRoom_BRIRs
View on GitHub
The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…
☆22Mar 24, 2017Updated 9 years ago
sh01k / KernelInterpSpatialANC
View on GitHub
Spatial active noise control based on kernel interpolation of sound field
☆15Mar 30, 2023Updated 3 years ago
GLJS / audio-datasets
View on GitHub
GitHub Repository for the Survey Paper on Audio-Language Datasets for Scenes and Events
☆17Feb 7, 2025Updated last year
sonalkum / MMAUPro
View on GitHub
Official repo for MMAU-Pro Benchmark
☆22Sep 25, 2025Updated 9 months ago
kszpxxzmc / ViSAudio
View on GitHub
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
☆117Dec 11, 2025Updated 7 months ago
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
Katarina-Poole / Spatial-Audio-Metrics
View on GitHub
Spatial Audio Metrics (SAM) is a toolbox to analyse spatial audio and spatial audio perceptual experiments
☆37May 16, 2026Updated 2 months ago
facebookresearch / A2B
View on GitHub
A2B Neural Rendering of Ambisonic Recordings to Binaural
☆20Aug 5, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
CookiePPP / podcast_rss_feeds
View on GitHub
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
☆31Apr 13, 2023Updated 3 years ago
vlsi-nanocomputing / dynamic-sound
View on GitHub
DynamicSound Simulator is a modular Python library for generating virtual acoustic scenes with configurable microphones, sound sources, a…
☆18Updated this week
james34602 / Neural-Network-Quadraphonic-Upmix
View on GitHub
The simplest way to demix stereo content with decent quality and low latency.
☆19Apr 11, 2019Updated 7 years ago
jingkangqi / DSENet
View on GitHub
☆34Feb 19, 2025Updated last year
fschmid56 / PretrainedSED
View on GitHub
☆144May 13, 2025Updated last year
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
jin-woo-lee / nfs-binaural
View on GitHub
☆13Aug 13, 2023Updated 2 years ago
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆46Sep 17, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhikangNiu / Semantic-VAE
View on GitHub
[INTERSPEECH 2026 Oral]Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
☆120Jun 21, 2026Updated 3 weeks ago
sarulab-speech / SpatialCLAP
View on GitHub
☆19Oct 9, 2025Updated 9 months ago
QxLabIreland / Binaspect
View on GitHub
A Python Library for Full Reference Binaural Fidelity Testing, Visualization & Feature Generation
☆30Oct 30, 2025Updated 8 months ago
InternLM / StarBench
View on GitHub
[ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"
☆42Apr 19, 2026Updated 3 months ago
VikasTokala / BCCTN
View on GitHub
☆33Jun 10, 2025Updated last year
SmartSoundKAIST / 6DRIR-DL
View on GitHub
6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid
☆17Aug 31, 2023Updated 2 years ago
ga642381 / Spoken-Dialogue-Model-Survey
View on GitHub
A survey of spoken dialogue models (SDMs) with speech input and speech output. Focus on their Intermediate Representation and Generation …
☆30Mar 24, 2026Updated 3 months ago