JiabenChen / iQueryLinks
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆66Updated last year
Alternatives and similar repositories for iQuery
Users that are interested in iQuery are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆27Updated last year
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 3 months ago
- For Ego4D VQ3D Task☆20Updated last year
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆19Updated last year
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆43Updated 2 years ago
- Bidirectional Mapping between Action Physical-Semantic Space☆31Updated 9 months ago
- Official implementation of EgoHOD at ICLR 2025☆18Updated 3 months ago
- Download scripts and tools for Replay dataset.☆32Updated last year
- ☆20Updated last year
- Code release for PianoMotion10M☆82Updated 2 months ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆33Updated 2 years ago
- ☆17Updated 11 months ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆39Updated last year
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆59Updated 9 months ago
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆97Updated 2 years ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆104Updated 6 months ago
- ☆31Updated last year
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆24Updated last year
- [CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…☆25Updated 2 years ago
- This is a third party implementation of the paper "The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective".☆9Updated 2 months ago
- [CVPR2023]Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning☆18Updated 2 years ago
- A Chrome/Edge extension to help you quickly scan through the flood of daily ArXiv papers.☆14Updated 2 months ago
- SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)☆19Updated 2 years ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆24Updated 6 months ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆54Updated last year
- ☆61Updated 2 years ago
- ☆81Updated this week
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆60Updated 2 years ago
- Hearing Anything Anywhere Code Release☆40Updated 11 months ago
- ☆24Updated 9 months ago