SAGNIKMJR / few-shot-rir
Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)
☆14Updated last year
Alternatives and similar repositories for few-shot-rir:
Users that are interested in few-shot-rir are comparing it to the libraries listed below
- Repo for Visual Acoustic Matching, CVPR 2022☆66Updated 2 years ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆23Updated 2 years ago
- Code for paper Learning Audio-Visual Dereverberation☆26Updated 2 years ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆42Updated 6 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆44Updated last month
- ☆47Updated 7 months ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 10 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 2 years ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆30Updated last year
- ☆39Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 6 months ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆64Updated 6 months ago
- SRTNet☆24Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 2 months ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆9Updated last year
- ☆82Updated last year
- ☆28Updated 2 years ago
- Source code for the paper 'Audio Captioning Transformer'☆53Updated 3 years ago
- [Neurips'24 Spotlight] Official code for "Acoustic Volume Rendering for Neural Impulse Response Fields"☆28Updated 2 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated 2 years ago
- Accompanying code for our paper "Point Cloud Audio Processing"☆19Updated 3 years ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆28Updated 8 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆27Updated 2 months ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆19Updated 2 years ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆39Updated 6 months ago