alvinliu0 / Visual-Sound-Localization-in-the-WildLinks
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Updated 3 years ago
Alternatives and similar repositories for Visual-Sound-Localization-in-the-Wild
Users that are interested in Visual-Sound-Localization-in-the-Wild are comparing it to the libraries listed below
Sorting:
- 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding, ECCV 2022☆17Updated 2 years ago
- ☆10Updated 10 months ago
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆35Updated 7 months ago
- ETHSeg: An Amodel Instance Segmentation Network and a Real-world Dataset for X-Ray Waste Inspection (CVPR2022)☆14Updated 2 years ago
- Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)☆22Updated 3 years ago
- Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.☆43Updated last year
- Monocular depth estimation from a single image☆21Updated 5 years ago
- This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.☆21Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Updated 2 years ago
- ☆13Updated 2 months ago
- ☆38Updated 3 years ago
- A curated list of neural rendering resources.☆44Updated 3 years ago
- CVPR 2021 Oral https://arxiv.org/abs/2104.02243☆47Updated last year
- Project page of "Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image"☆9Updated 11 months ago
- Official code for "Opening up Open World Tracking" (CVPR 2022)☆56Updated 2 years ago
- A practice for million-scale multi-domain universal object detection☆27Updated 11 months ago
- ☆20Updated 3 years ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- A curated list of papers & open source codes about diffusion models☆9Updated 3 years ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18Updated last year
- Wild Panoramic Panoptic Segmentation dataset☆14Updated 2 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆61Updated 2 years ago
- ☆19Updated last year
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆17Updated last year
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆25Updated 10 months ago
- ☆19Updated 4 years ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆20Updated last year
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- [NeurIPS 2022 Spotlight] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator☆30Updated 2 years ago