alvinliu0 / Visual-Sound-Localization-in-the-WildLinks

Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).

☆29

Alternatives and similar repositories for Visual-Sound-Localization-in-the-Wild

Users that are interested in Visual-Sound-Localization-in-the-Wild are comparing it to the libraries listed below

Sorting:

vLAR-group / UnsupObjSeg
🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)
☆36Updated 9 months ago
tgxs002 / wikiscenes
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.
☆43Updated last year
zzw-zwzhang / Awesome-of-Neural-Rendering
A curated list of neural rendering resources.
☆44Updated 3 years ago
TerenceCYJ / 4DContrast
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding, ECCV 2022
☆17Updated 3 years ago
hyungkwonko / awesome-diffusion-models
A curated list of papers & open source codes about diffusion models
☆9Updated 3 years ago
krantiparida / beyond-image-to-depth
☆38Updated 4 years ago
wang-chen / lgl-feature-matching
Lifelong Graph Learning (CVPR 2022) [Feature Matching]
☆31Updated 3 years ago
RuojinCai / ExtremeRotation_code
Extreme Rotation Estimation using Dense Correlation Volumes
☆46Updated 2 years ago
minghanz / DepthC3D
Monocular depth estimation from a single image
☆21Updated 5 years ago
wyndwarrior / autoregressive-bbox
☆17Updated 2 years ago
EPFL-VILAB / omnidata-paper-code-dump
☆20Updated 3 years ago
KaiChen1998 / MultiSiam
Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)
☆22Updated 3 years ago
YangLiu14 / Open-World-Tracking
Official code for "Opening up Open World Tracking" (CVPR 2022)
☆56Updated 2 years ago
alexanderjaus / PPS
Wild Panoramic Panoptic Segmentation dataset
☆14Updated 2 years ago
imelekhov / HNDesc
A CNN-based local image descriptor
☆23Updated 3 years ago
OmidPoursaeed / Self_supervised_Learning_Point_Clouds
Self-supervised Learning of Point Clouds via Orientation Estimation (3DV 2020)
☆16Updated 4 years ago
VITA-Group / Simple3D-Former
[Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…
☆62Updated 2 years ago
germain-hug / NeurHal
Visual Correspondence Hallucination: Towards Geometric Reasoning (Under Review)
☆29Updated 2 years ago
facebookresearch / imu2clip
Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)
☆93Updated last year
multimodallearning / flownet3d.pytorch
PyTorch Implementation of FlowNet3D (https://arxiv.org/pdf/1806.01411.pdf)
☆15Updated 5 years ago
RaduAlexandru / lattice_net
Fast Point Cloud Segmentation Using Permutohedral Lattices
☆16Updated 2 years ago
liuzhengzhe / 3D-to-2D-Distillation-for-Indoor-Scene-Parsing
CVPR 2021 Oral https://arxiv.org/abs/2104.02243
☆47Updated last year
cassiePython / MNeuEdit
Code for Mesh-Guided Neural Implicit Field Editing.
☆19Updated last year
bpiyush / rotation-equivariant-lfm
Rotation equivariance meets local feature matching
☆18Updated 2 years ago
ChristophReich1996 / Optical-Flow-Visualization-PyTorch
PyTorch implementation of the classical optical flow visualization by Baker et al. [ICCV 2007].
☆39Updated 3 years ago
jianglongye / implicit-tracking
Online Adaptation for Implicit Object Tracking and Shape Reconstruction in the Wild, RA-L 2022
☆31Updated last year
JasonQSY / Associative3D
[ECCV 2020] Associative3D: Volumetric Reconstruction from Sparse Views
☆34Updated 3 years ago
zhuhu00 / Paper-Daily-Notice
Get the papers you want from ArXiv every weekday.
☆25Updated 2 years ago
kkaiwwana / MVPbev
[ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…
☆19Updated 5 months ago
rpSebastian / gs-cite-fellow
Which fellows cited my article?
☆24Updated 3 years ago