alvinliu0 / Visual-Sound-Localization-in-the-Wild
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Visual-Sound-Localization-in-the-Wild
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆32Updated last year
- A curated list of neural rendering resources.☆44Updated 3 years ago
- A practice for million-scale multi-domain universal object detection☆22Updated 4 months ago
- Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.☆42Updated 6 months ago
- A curated list of papers & open source codes about diffusion models☆8Updated 3 years ago
- Official PyTorch implementation of MultiSiam in ICCV 2021 (https://arxiv.org/abs/2108.12178)☆22Updated 3 years ago
- Monocular depth estimation from a single image☆21Updated 4 years ago
- ☆38Updated 3 years ago
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆34Updated 3 weeks ago
- PyTorch re-implementation of Hierarchical Normalization for Robust Monocular Depth Estimation☆14Updated last year
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆17Updated 8 months ago
- [NeurIPS 2022 Spotlight] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator☆30Updated 2 years ago
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆23Updated 3 months ago
- Which fellows cited my article?☆22Updated 2 years ago
- This is the official pytorch implementation for the paper: Instance Similarity Learning for Unsupervised Feature Representation.☆21Updated 3 years ago
- Video Panoptic Segmentation☆16Updated 4 years ago
- Sora Generates Videos with Stunning Geometrical Consistency☆46Updated 7 months ago
- ☆32Updated 2 years ago
- The code of 'The devil is in the labels: Semantic segmentation from sentences'.☆2Updated 4 months ago
- Code release for "From Image Collections to Point Clouds with Self-supervised Shape and Pose Networks" (CVPR 2020)☆22Updated 4 years ago
- The Official Implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose [NIPS 2021](https://ar…☆20Updated 2 years ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18Updated 6 months ago
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆13Updated 9 months ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆28Updated 8 months ago
- A simple interactive visualization toolkit for MVS that works on server without X11.☆13Updated 3 years ago
- Code for recreating the HoS benchmark of VISOR☆19Updated last year
- Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer☆21Updated 2 years ago