SAGNIKMJR / ego-AV-spatial-correspondenceView external linksLinks
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆13Jun 16, 2024Updated last year
Alternatives and similar repositories for ego-AV-spatial-correspondence
Users that are interested in ego-AV-spatial-correspondence are comparing it to the libraries listed below
Sorting:
- ☆21Feb 15, 2022Updated 4 years ago
- ☆33Apr 10, 2023Updated 2 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Official implementation for CIGN☆17Sep 11, 2023Updated 2 years ago
- Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence☆19Jun 14, 2024Updated last year
- ☆19May 19, 2024Updated last year
- ☆22Mar 20, 2024Updated last year
- Compress conventional Vision-Language Pre-training data☆53Sep 22, 2023Updated 2 years ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆24Aug 12, 2022Updated 3 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆59Nov 23, 2020Updated 5 years ago
- Official Implementation of "Open-Vocabulary Audio-Visual Semantic Segmentation" [ACM MM 2024 Oral].☆35Nov 2, 2024Updated last year
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆139Mar 28, 2025Updated 10 months ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆79Aug 26, 2025Updated 5 months ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Dec 4, 2024Updated last year
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆34Feb 5, 2024Updated 2 years ago
- ☆30Jun 14, 2022Updated 3 years ago
- ☆35Jun 6, 2023Updated 2 years ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆48May 1, 2023Updated 2 years ago
- ☆36Jul 9, 2025Updated 7 months ago
- ☆27Jan 9, 2026Updated last month
- ☆10Apr 28, 2023Updated 2 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 5, 2026Updated last week
- [ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).☆162Apr 5, 2023Updated 2 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Dec 15, 2020Updated 5 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- Unofficial PyTorch implementation of MapNet: An Allocentric Spatial Memory for Mapping Environments☆12Jun 4, 2020Updated 5 years ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆22Jan 23, 2026Updated 3 weeks ago
- ☆19Jul 22, 2025Updated 6 months ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- ☆13Nov 28, 2021Updated 4 years ago
- ☆13May 21, 2024Updated last year
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆26Apr 27, 2025Updated 9 months ago
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last week
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆18Jan 26, 2025Updated last year
- The code for On Robust Cross-View Consistency in Outdoor Self-Supervised Monocular Depth Estimation☆13Jun 2, 2023Updated 2 years ago