JiabenChen / iQuery
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆66Updated last year
Alternatives and similar repositories for iQuery
Users that are interested in iQuery are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆27Updated last year
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 2 months ago
- Bidirectional Mapping between Action Physical-Semantic Space☆31Updated 8 months ago
- ☆20Updated last year
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆37Updated last year
- Download scripts and tools for Replay dataset.☆32Updated last year
- ☆32Updated last year
- ☆82Updated 11 months ago
- Hearing Anything Anywhere Code Release☆38Updated 11 months ago
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆42Updated 2 years ago
- ☆35Updated last month
- Official implementation of EgoHOD at ICLR 2025☆15Updated 2 months ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆33Updated 2 years ago
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆97Updated 2 years ago
- Code release for PianoMotion10M☆82Updated last month
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆24Updated last year
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆31Updated 11 months ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆25Updated 6 months ago
- ☆61Updated 2 years ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 6 months ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆19Updated last month
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆58Updated 8 months ago
- ☆17Updated 10 months ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆69Updated 5 months ago
- A Chrome/Edge extension to help you quickly scan through the flood of daily ArXiv papers.☆14Updated last month
- ☆21Updated 9 months ago
- For Ego4D VQ3D Task☆19Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆86Updated last year
- Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…☆12Updated 2 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆27Updated last year