IFICL/SLfM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IFICL/SLfM)

IFICL / SLfM

Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation

☆43

Alternatives and similar repositories for SLfM

Users that are interested in SLfM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IFICL / stereocrw
View on GitHub
Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation
☆28Mar 15, 2023Updated 3 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
stoneMo / EZ-VSL
View on GitHub
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆41Oct 2, 2022Updated 3 years ago
bingo-todd / WaveLoc
View on GitHub
End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cvlab-columbia / paperbot
View on GitHub
PaperBot: Learning to Design Real-World Tools Using Paper
☆13Mar 15, 2024Updated 2 years ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
SAGNIKMJR / move2hear-active-AV-separation
View on GitHub
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
☆16Jun 17, 2026Updated last month
v-iashin / SparseSync
View on GitHub
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
☆56Jan 29, 2024Updated 2 years ago
tiangeluo / ShapeCompiler
View on GitHub
A Unified Framework for Transforming between Text, Point Cloud, and Program
☆19Jul 3, 2025Updated last year
hohsiangwu / rethinking-visual-sound-localization
View on GitHub
Official implementation of the paper How to Listen? Rethinking Visual Sound Localization
☆18Apr 25, 2022Updated 4 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
OpenNLPLab / FNAC_AVL
View on GitHub
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆29Apr 10, 2023Updated 3 years ago
XYPB / CondFoleyGen
View on GitHub
Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".
☆93Dec 8, 2023Updated 2 years ago
epic-kitchens / epic-sounds-annotations
View on GitHub
Splits for epic-sounds dataset
☆85Aug 2, 2025Updated 11 months ago
jinlinyi / 3DFIRES
View on GitHub
[CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces
☆26Mar 28, 2024Updated 2 years ago
yuhanghe01 / Sound3DVDet
View on GitHub
Code for WACV24 work for multiview acoustic-visual detection
☆13Mar 22, 2024Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
hche11 / Localizing-Visual-Sounds-the-Hard-Way
View on GitHub
Localizing Visual Sounds the Hard Way
☆84Jul 6, 2022Updated 4 years ago
5yearsKim / beamforming
View on GitHub
implementing beamforming algorithm in C++
☆11Jan 9, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zihuixue / seeAoT
View on GitHub
Code and data release for the paper "Seeing the Arrow of Time in Large Multimodal Models"
☆16Oct 2, 2025Updated 9 months ago
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
IFICL / images-that-sound
View on GitHub
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
☆252Updated this week
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆106Aug 11, 2023Updated 2 years ago
facebookresearch / sound-spaces
View on GitHub
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…
☆468Sep 29, 2023Updated 2 years ago
ardasnck / learning_to_localize_sound_source
View on GitHub
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
☆102Dec 4, 2024Updated last year
SMILE-data / SMILE
View on GitHub
SMILE: A Multimodal Dataset for Understanding Laughter
☆13Jun 15, 2023Updated 3 years ago
facebookresearch / viewseg
View on GitHub
Code for "Recognizing Scenes from Novel Viewpoints"
☆29Sep 16, 2022Updated 3 years ago
florianHofherr / PhysParamInference
View on GitHub
☆19Jan 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / visual-acoustic-matching
View on GitHub
Repo for Visual Acoustic Matching, CVPR 2022
☆71Feb 28, 2023Updated 3 years ago
MohammadYehya / Quantum_Edge_Detection
View on GitHub
Quantum Edge Detection using different Quantum Encoding strategies which include Flexible Representation of Quantum Images (FRQI), Novel …
☆16Feb 15, 2025Updated last year
bingo-todd / GCC-PHAT_DNN_Loc
View on GitHub
DNN based binaural sound localization model, using GCC-PHAT as features
☆22Jun 13, 2023Updated 3 years ago
ekazakos / auditory-slow-fast
View on GitHub
Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch
☆73Sep 27, 2021Updated 4 years ago
dkurzend / ClipClap-GZSL
View on GitHub
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
☆23Apr 15, 2024Updated 2 years ago
pritamqu / CrissCross
View on GitHub
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
☆26Jul 11, 2023Updated 3 years ago
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year