apple / ml-nvas3dLinks
☆47Updated 11 months ago
Alternatives and similar repositories for ml-nvas3d
Users that are interested in ml-nvas3d are comparing it to the libraries listed below
Sorting:
- A large synthetic dataset of spatial audio with multiple labels☆111Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆68Updated 2 years ago
- Viterbi decoding in PyTorch☆34Updated last month
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated this week
- Codebase and project page for EDMSound☆34Updated last year
- ☆15Updated last year
- A GPU accelerated and torch based audio DSP library☆84Updated last week
- Code for paper Learning Audio-Visual Dereverberation☆29Updated 2 years ago
- small audio language model for reasoning☆64Updated 2 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆75Updated last month
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆54Updated 4 months ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆22Updated 9 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆34Updated last month
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆21Updated last month
- Da - ECHO - RetrievAl - daTasEt☆26Updated 11 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- [Neurips'24 Spotlight] Official code for "Acoustic Volume Rendering for Neural Impulse Response Fields"☆37Updated 5 months ago
- Audio-Visual Room Impulse Response Estimation☆17Updated 11 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆45Updated 3 months ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆48Updated 9 months ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆17Updated 10 months ago
- Official implementation for FlowSep☆52Updated 5 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 9 months ago
- ☆67Updated last year
- This is the official implementation of reverberant speech to room impulse response estimator☆33Updated 10 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆47Updated last month
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆94Updated 6 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 10 months ago