JiabenChen / iQuery
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆65Updated last year
Alternatives and similar repositories for iQuery:
Users that are interested in iQuery are comparing it to the libraries listed below
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆27Updated last year
- Hearing Anything Anywhere Code Release☆38Updated 10 months ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆33Updated 2 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆37Updated last year
- Download scripts and tools for Replay dataset.☆32Updated last year
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆31Updated 11 months ago
- ☆34Updated last week
- Code release for PianoMotion10M☆77Updated 3 weeks ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆19Updated 2 weeks ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆18Updated last year
- ☆20Updated last year
- ☆21Updated 8 months ago
- Bidirectional Mapping between Action Physical-Semantic Space☆31Updated 7 months ago
- ☆32Updated last year
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 2 months ago
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆24Updated 2 years ago
- Official implementation for MGN☆20Updated 2 years ago
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆42Updated 2 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆24Updated last year
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆97Updated 2 years ago
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆10Updated 6 months ago
- ☆24Updated 2 years ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆16Updated 2 months ago
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆14Updated 3 years ago
- Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark☆47Updated 7 months ago
- ☆45Updated 9 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 5 months ago
- ☆17Updated 10 months ago
- Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…☆12Updated 2 years ago
- ☆61Updated last year