dieKarotte / ASAudioLinks
☆39Updated 3 months ago
Alternatives and similar repositories for ASAudio
Users that are interested in ASAudio are comparing it to the libraries listed below
Sorting:
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 8 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Updated 3 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆92Updated 5 months ago
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆73Updated 11 months ago
- Official repository for FlowSE (Interspeech 2025)☆85Updated 7 months ago
- The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]☆140Updated this week
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆87Updated 6 months ago
- Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverb…☆29Updated 3 months ago
- Query-conditioned target sound extraction model☆30Updated 10 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆76Updated 3 weeks ago
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆32Updated 2 months ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆150Updated 9 months ago
- ☆130Updated 2 weeks ago
- AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1☆43Updated last month
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆58Updated 7 months ago
- ☆61Updated 2 years ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆79Updated 8 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Updated last year
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆22Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆55Updated 9 months ago
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Updated 2 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆194Updated last year
- Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding☆19Updated 9 months ago
- ☆14Updated last year
- ☆120Updated 2 years ago
- ☆35Updated last year
- Implementation of SpatialCodec.☆68Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆60Updated 3 years ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆45Updated 10 months ago