alvinliu0 / Visual-Sound-Localization-in-the-Wild
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Updated 3 years ago
Alternatives and similar repositories for Visual-Sound-Localization-in-the-Wild
Users that are interested in Visual-Sound-Localization-in-the-Wild are comparing it to the libraries listed below
Sorting:
- ☆10Updated 10 months ago
- ☆38Updated 3 years ago
- Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.☆43Updated last year
- 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding, ECCV 2022☆17Updated 2 years ago
- This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.☆21Updated 3 years ago
- Code for Mesh-Guided Neural Implicit Field Editing.☆19Updated last year
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆34Updated 7 months ago
- Official code for "Opening up Open World Tracking" (CVPR 2022)☆56Updated 2 years ago
- PyTorch implementation for DESC - BMVC20 (Oral) & IJCV22☆17Updated 2 years ago
- A curated list of neural rendering resources.☆44Updated 3 years ago
- Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)☆22Updated 3 years ago
- Project page of "Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image"☆9Updated 11 months ago
- Video Panoptic Segmentation☆16Updated 4 years ago
- ETHSeg: An Amodel Instance Segmentation Network and a Real-world Dataset for X-Ray Waste Inspection (CVPR2022)☆14Updated 2 years ago
- ☆19Updated 4 years ago
- Market-1501 dataset with super-resolution quality☆19Updated 3 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆61Updated 2 years ago
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆31Updated last year
- A practice for million-scale multi-domain universal object detection☆27Updated 11 months ago
- Monocular depth estimation from a single image☆21Updated 5 years ago
- Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer☆21Updated 3 years ago
- ☆11Updated 4 years ago
- ☆20Updated 3 years ago
- A curated list of papers & open source codes about diffusion models☆9Updated 3 years ago
- Training with Product Digital Twins for AutoRetail Checkout☆18Updated last year
- Self-supervised Learning of Point Clouds via Orientation Estimation (3DV 2020)☆16Updated 3 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Updated 3 years ago
- [NeurIPS 2022 Spotlight] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator☆30Updated 2 years ago
- ☆26Updated 4 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated 2 years ago