apple / ml-nvas3d
☆46Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for ml-nvas3d
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆37Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- Code for Novel View Acoustic Synthesis paper☆44Updated last year
- A large synthetic dataset of spatial audio with multiple labels☆92Updated last year
- Repo for Visual Acoustic Matching, CVPR 2022☆65Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆26Updated 2 years ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆31Updated 2 months ago
- Codebase and project page for EDMSound☆29Updated last year
- ☆61Updated 7 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆18Updated 2 months ago
- ☆40Updated 5 months ago
- This is the official implementation of reverberant speech to room impulse response estimator☆19Updated 3 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆31Updated 10 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 8 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- ☆23Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆57Updated last year
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆32Updated last month
- Viterbi decoding in PyTorch☆27Updated last month
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆32Updated this week
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆35Updated last year
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆14Updated last year
- ☆25Updated 3 months ago
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆22Updated last year
- ☆79Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆57Updated 2 months ago
- A unified model for zero-shot singing voice conversion and synthesis☆21Updated last year